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INTRODUCTION 



If we define a field in terms of scholarly endeavor language assessment is a fairly recent development Concepts of 
measurement remained in isolation from the study of language acquisition through the nineteenth century. Thomas 
Simpson's innovations of observational, error during the eighteenth century, for instance, remained confined to the 
study of mathematics (only a few of his contemporaries, particularly the Marquis de La Place, and the Comte de La 
Grange ventured to apply these mathematical concepts to astronomy). So fared the probability and antecedental 
formulations to the "normal curve*, as well, by another of Simpson's contemporaries, the Huguenot mathematician. 
Abraham De Moivre. Within one hundred years. Carl Friedrich Gauss would proclaim mathematics, "the Queen of 
Sciences." and introduce the application of the "normal curve" to other disciplines. The chasm between measurement 
and language appeared to have been inadverdently and tenuously bridged through the Wundtian school, as well as 
through the work of the eugenicist. Sir Francis G jiton. Wilhelm Wundt's "relativity theory" formed a framework for his 
devoted followers to explain the psycholinguistic relationships between "Gliederung" and "Gesamtvorsfellung." It was 
Galton in his experiments, who had attempted to apply the theories of his cousin. Charles Darwin, in order to prove the 
inheritability of intelligence. Although ensconced in psychology, the important linkages between language and mea- 
surement seem to have become apparent, albeit grudgingly, to such modern day inheritors of "Galtonian" tradition as 
Arthur Jensen. 

Perhaps researchers will declare, within forthcoming decades, that the emergence of language assessment as an inde- 
pendent entity resulted in large part from developments in Civil Rights movements and a quest for social justice. Some 
modicum of evidence can be brought forth in support of this contention, when or^e views the historical rise of bilingual 
education and the plight of limited-English proficient populations in the United States. New terms would appear in 
educational vocabulary, such as language "dominance" and "proficiency." indicating additional frontiers for exploration 
and pardoxes for resolution. We can exemplify this last poirit by suggesting Roger Shuy's innovative concept of the 
"iceberg effect." Introduced in 1976. the concept posed the dilemma of reconciling statistical precision in quantifica- 
tion with true and complete representation of observable data. The closer one arrives to the tip of the "iceberg." the 
more statistically accurate are representations of surface structures (i.e.. phonology, vocabulary, and grammar). The 
further away we go below the suriface-line toward deep structures, the more accurate we are in representing semantic 
meaning. On one level, the precision/representation dilemma is couched in terms of discrete-point and global 
measures. On another level, the problem is inbicative of reconciling linguistics and psychometry in language 
assessment. It also suggests the cross-fertitization of language assessment by developments from a myriad of 
disciplines, which continue to increase respective realms of knowledge at a geometric rate. With the developments 
will arise new issues and paradoxes for resolution, as the alchemists of today give way to the innovators of tomorrow. 
For indeed, to fail to claim that one collection of ideas holds sway over one historical period would be tantamount to 
measuring one chronological era against the standards of another 

It was with the.thought of bringing current issues to the forefront, that the First Annual Language Assessment Institute 
was conducted at National College of Education, This volume was conceived as the first in a series to include deliveries 
by participants and invited papers from theoreticians and practitioners in the field. Sponsors of the Institute, which 
will be repeated annually, include the National Clearinghouse for Bilingual Education: Educational Testing Service: the 
Evaluation. Dissemination and Assessment Center (Dallas. Texas); the Bilingual Education Servjce Center (Illinois); the 
Illinois State Bilingual Office: the Illinois Resource Center; and National College of Education, Papers in this volume are 
grouped into three succinct categories: Foundation. Assessment Approaches. Research and Policy. 

Behind the scenes in the production of any volume of this nature, are the unsung heroes, who commit countless hours 
of toil and dedication. It is with grateful thanks and appreciation that acknowledgement is made to Jean Honeyman. 
Manager. Staff Support Services. Illinois State Board of Education, and to Lynn Rhoades. Typesetter. Staff Support 
Services, who so willingly gave so much time and expertise in the typesetting of this project. Bill Becker. Manager. Illi- 
nois State Board of Education. Internal Office Support and Printing Section, and Bernie Neff. Assistant Manager. Print- 
ing and Graphics were invaluable in providing much-needed material support and creative ideas. Special gratitude 
goes to Linda McElroy who spent many hours poring over the manuscripts and is deserving of the accolade, "an edi- 
tor's editor." Cornelia Powell and Steve Rothenberg added their much appreciated talent in helping design the cover 
and other graphics I would also like to express my indebtedness to Maria Medina Seidner for her professionalism and 
patient, loving support and to my infant daughter. Jacqueline Elinor, for prolonged and welcome moments of quiet in 
completing this volume. Finally, my appreciation goes to the contributors of this volume for their excellent presenta- 
tions and papers. 

Editor 

Dr Stanleys, Seidner 
Assistant Dean for 

Research and Development 
National College of Education 
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ZENO'S PARADOX AND LANGUAGE ASSESSMENT 



Rudolph C. Troike 
University of Illinois at Urbana-Champdign 



My fi^rst experience with language assessment was over 
twentV years and two linguistic paradigms agO; when I 
joined the Georgetown University English Language 
Program m Turkey and found myself involved in testing 
AID participants using the Foreign Service Institute in- 
terview approach, and testing Turkish students in En- 
glish classes using discrete point tests. In that dimly 
remembered and distant past, we were all confident 
positivists and knew all there was to know about Ian- * 
guage and language assessment — or at least how to 
find out what we didn't know. But as Robert Lado once 
stated, "the older you get the more you find you have to 
liva down." Or. as the Pennsylvania Dutch saying goes. 
"How come you're too soon old and too late smart?" 

Today we have reached the point — or at least some of 
us have — of true transcendent wisdom. We now know 
that what we thought we knew in the past was wrong, 
or incomplete, and we have begun to understand 
enough about language to realize how little we now 
know As we learn^more and more, we recognize we 
know less and less. Thus we are gradually approaching 
the achievement of the ultimate truth, which will be to 
knc?w everyth'tag about nothing and nothing about 
everything. 

One consolation in our present state of wisdom is that 
at least no one will try to fault us for being prematurely 
overconfident. Those who remember back to the heady 
days of yesteryear may regret one thing which was lost 
along with our youthful innocence — the sublime satis- 
faction of having all the answers whenever someone 
asked. 

Unfortunately, people are still asking questions and ex- 
pecting answers, and unfortunately now that we have 
achieved wisdom, wa no longer have such ready an- 
swers The price of wisdom is the awareness of igno- 
rance, which IS not nearly as satisfying as knowing 
everything. Indeed our inability or hesitation to answer 
questions can be very frustrating, both to questioners 
and to language assessment specialists alike. Of course, 
there are those who feel that when ignorance is bliss, 
'tis folly to be wise, and confidently stand ready to dis- 
pense quick answers when asked The dilemma is that 
there are massive national needs calling for solution, 
and school administrators and teachers who desperate- 
ly need answers. As specialists knowledgeable about 
the state of language assessment, we would be derelict 
in our responsibilities if we refused to respond or turned 
them away empty-handed. 



from the front to the back of a room. In doing so. you 
will progressively cover half of the distance, and then 
half of the remaining distance, and so on through an infi- 
nite series of successive approximations. In theory, no 
matter how long you go. you can never reach the final 
point. In reality, of course, we do in fact reach the point 
we set out for There is a moral here. I believe, for lan- 
guage assessment, for our pursuit of understanding is 
infinite, and we know we can. never arrive. Nevertheless, 
we have practical issues and problems in the real woHd 
to deal with, and we must do what we can in the interim 
with the tools and knowledge provided by our latest 
successive approximation. 

I should now like to turn to the subject of language as- 
sessment, and consider what it is we are about and 
why There is an interesting interaction between the 
two component elements of the name itself, which 
strongly affects what language assessment actually 
consists of. First, the conception of language which is 
held will partly determine what \s assessed and how h 
is done, and secondly, the view of appropriate assess- 
ment procedures will affect the choice of what fs mea- 
sured which for the purposes of assessment becomes a 
de facto definition of language, i.e.. language is what 
the assessment procedure/instrument measures. Both 
aspects in turn are strongly affected by the purposes of 
assessment. A definition of language or a specification 
of assessment scope or procedures cannot be specified 
in absolute terms independently of why an assessment 
is to be made. 

Several examples will serve as a caveat. Twenty years 
ago the prevailing linguistic paradigm viewed phonolo- 
gy as the most important aspect of language, grammar 
as next in importance, and vocabulary or meaning as 
least important. The transformational paradigm rejected 
this conception and placed grammar on top. only to be 
overturned by Fillmore's case grammar and other relat- 
ed models which gave primacy to meaning, with gram- 
mar next, and phonology least in importance — a com- 
plete inversion of the earlier "structuralist" view. 

Sociolinguistic and anthropological (not to mention psy- 
chological) perspectives have added additional compli- 
cations to the concept of language by removing it from 
*he antiseptic decontextualized ideal realm to real life 
settings, where domains, repertoires, skills, roles, sta- 
tuses, attitudes, settings, topics, knowledge, personality, 
and individual relationships interact to affect linguistic 
competence and performance in various situations. 



Which "bnngs us to Zeno's paradox, which I believe is 
relevant here because the Greeks were always interest- 
fid in truth I have chosen Zeno's paradox because I 
think ft appropriately symbolizes the situation in which 
we find ourselves In its simplest form, it deals with the 
problem of attempting to reach a goal - say. walking 

ERIC 



The older view of language as primarily oral caused 
reading and wntmg (as well as manual signing) to be ig- 
nored for testing, a senous bias which unfortunately 
continues to the present. A caveat that I would raise at 
this time IS that the bnght new hope of communicative 
competency testing may. depending on our purposes. 
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lead us still further astray. We must be careful that we 
do not find ourselves in Zeno's paradox in reverse, with 
each new step leading us halfway again away from our 
goal. 

As stated earlier what we assess and how we assess it 
is intimately associated with the purpose of assessment 
As Bernard Spolsky noted at the Language Proficiency 
Assessment Symposium in Warrenton. Virginia, perhaps 
the most efficient test on record was the single-item dis- 
crete pomt criterion-referenced entry-exit test men- 
tioned in the Book of Judges in the Old Testament: 
when the Ephraimites sought to cross the Jordan, the 
Gileadites asked them to pronounce the word for fish; 
since there was a %/s isogloss between the two tribal 
varieties of Hebrew, if they said sibboleth instead of 
shibboleth, they were deemed to have failed the test 
and instantly exited from existence. Test statistics state 
that 42.000 Ephraimites failed the test that day. 

At the other end of the spectrum, we have the recent 
proposal by Michael Canale of OlSE for an extensive 
and sociolinguistically sophisticated assessment of Ian- 
gauge proficiency, essentially 6y making a detailed in- 
ventory of communicative competence. But here again 
let me raise another caveat regarding purpose. We need 
to determine if what we are measuring bears a meaning- 
ful relation to the reasons for making the measurements. 
For those concerned with bilingual education and the 
school achievement' of children from non-English lan- 
guage backgrounds (NELB). I would like to propose the 
perhaps startling idea that we may be making a serious 
mistake m assessing language at all as a criterion for 
program exit 

Lest I be misunderstood, let me hasten to explain. When 
bilingual education was getting started in this country a 
little over a decade ago. many of us argued for its 
necessity because of the language barrier, an argument 
adopted by the Supreme Court in the Lau vs Nichols de- 
cision: if the child could not understand English, he or 
she was obviously excluded from effective participation 
in the classroom Just as obviously, acquisition of suffi- 
cient English to follow instruction seemed to remove 
the barrier, so that this became the touchstone for most 
state and federal legislation, and for implementation of 
the Lau decision itself I have previously suggested else- 
where that his argument may have been mistaken. Espe- 
cially since the work of Skutnabb-Kangas and Toukomaa 
on Finnish immigrants in Sweden. James Cummins and 
I have argued on somewhat separate grounds that Eng- 
lish language proficiency, as ordinanly conceived, is, 
. beyond a certain minimum level, only a small and per- 
haps largely irrelevant factor in the school achievement 
of NELB students To take a simple example: various 
Americar Indian tribes have given up their native lan- 
guages and have adopted English instead, but they 
have not improved their school achievement as might 
have been expected if language proficiency were the 
main factor inhibiting achievement. 

The Finnish researchers. Cummins, and John Oiler as 
well have argued in the direction that there are deeper 
^ 'cognitive skills which underly the use of languagei and 
;^|^^"vhich may develop through the use of language. Tfjie di- 
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rection of my own argument has been that extrinsic and 
intrinsic social and cultural factors (inclu^Iing such 
things as the Pygmalion effect of self-fulfilling teacher 
expectations) may be of overriding importance vis a vis 
school achievement, as evidenced by the fact that 
Blacks in Texas and parts of California fare worse in 
school than do Hispanics. In this view, while the use of 
the student's language as a medium in a bilingual pro- 
gram may indeed promote the development of cognitive 
abilities, its chief value may be symbolic, by helping to 
legitimize the positive valuation of the student's culture, 
social group, and self. Otherwise we would be at a loss 
to account for the success of middle class majority stu- 
dents in second-language immersion programs in 
Canada and elsewhere. ar»d the massive failure of 
minority language students in regular all-English (sub- 
mersion) programs. 

Oiler meanwhile has proposed that language ability per 
se is at the root of the results of I.G. measures, and that 
what such measures reflect is basically a general lan- 
guage proficiency factor. Muriel Saville-Troike. on the 
other hand, has suggested that certain types of success- 
ful language proficiency, such as in interpersonal com- 
munication skills, may for primarily social reasons have 
a negative effect on the development of proficiency in 
handling decontextualized academic prose, which is es- 
cential for academic achievement. 

The problem is obviously complex and will require 
much more research -o resolve. Whichever way one 
goes, it is clear, as Cummins has pointed out. that rather 
superficial measures of English fluency (with no atten- 
tion to competence in the native language) may be com- 
pletely misleading with regard to the level of language 
control needed for more abstract decontextualized 
cognitive functions. Cummins has reported on children 
who sound like fluent native English speakers but who 
are yet having severe problems in school because of 
their limited ability to manipulate the language at a 
level necessary to successfully perform academic-type 
tasks. He suggests that it may take up to five years to ac- 
quire that ability, an observation that fits well with re- 
sults from several bilingual programs which show stu- 
dents reaching national norms in their fifth and sixth 
years. 

The difficulty we find ourselves in with regard to lan- 
guage assessment is that our original rationale for 
bilingual education is coming back to haunt us. Having 
successfully argued for bilingual education as a means 
to overcome the language barrier we not sufirisingly 
see English proficiency defined as the sole criterion for 
program entry and exit, even though we now realize 
that this may be a third or fourth-order factor or even 
completely misleading. Nevertheless, the parameters 
for language assessment are being determined primarily 
by legislators and administrators, rather than by special- 
ists in bilingual education or language assessment, and 
we face the danger that people will uncritically accept 
and respond to these parameters. The statement may 
sound heretical coming from a linguist, but I believe we 
may be making a mistake in attempting to assess lan- 
guage at all. The purpose of bilingual education should 
be educational, not just the development of English pro- 
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ficiency. Consequently, language assessment may very 
well be an inappropriate basis for educational decision 
making for limited-English proficient students. 



Since the denriands for language assessment are there* 
mandated by law and court decisions* what can we do? 
We can* as many already do* try to take other factors 
such as achievement scores into account* and avoid* if 
possible* using a language score as the sole criterion for 
program exit. Meanwhile* we need additional research, 
of course* but more importantly* we need to vyork to 
convirKie legislators* administrators* andiDthers that ac- 
ademic achievement rather than English proficiency* 
should be the basis for exiting from a bilingual program. 
Better than that* we should seek to remove the com- 
pensatory label from bilingual education and work to in- 
stitutionalize it as a permanent program option which is 
available to all students and in which requirements for 
program exit would not exist. 
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SOCIOLINGUISTIC FOUNDATIONS OF LANGUAGE ASSESSMENT 



by 

Jacob L. Ornstein-Galicia 
Department of Linguistics 
University of Texas 
El Paso, Texas 



Prefatory Note 

Does sociolinguistlcs have anything to offer the art 
of language assessment? That quite simply is the 
issue that I intend to address in this paper It is char- 
acteristic of Americans, and indeed It is an an ad- 
mirable feature, that they are ever interested in a 
new body of knowledge which will, hopefully bring 
great breakthroughs and which will advance us 
somewhat further to a truly Golden Age. I wish that 
I could sit here today and promise all of you interest- 
ed in bilingual education and in assessment, that 
low and behold the content of sociolinguistlcs is 
the kind of study that will solve ail your assessment 
problems and mine. Unfortunately, paraphrasing 
Churchhill of World War II. I can offer you little but 
blood, sweat — and at the worst — eyestrain from 
perusing the voluminous literature that has accu- 
mulated In sociolinguistlcs (hence SOL) both here 
and abroad. 

In this connection, it is heartening that the propo- 
nents of SOL have not fallen into the same error as 
those in succession, of Bloomfieldian structural 
li5iguistics. then Chomskian transformational gram- 
mar, then mathematical linguistics. So great is the 
chansma of psychology that it was likewise 
thought that the marriage of linguistics and the 
former discipline could not but lead to unprecedent- 
ed breakthroughs providing answers to all the puz- 
zles of applied and theoretical linguistics. With all 
due respect to the progress made in all these sub- 
fields, we now realize that there »s no quick way to 
find solutions in the language field. I'm afraid that Is 
also the plight of bilingual assessment and lan- 
guage proficiency testing in general, which does 
not mean that we can stop trying. 

After such a pessimistic side-tnp. I must return to 
my topic and sound a more optimistic note. Yes. I 
do feel that SOL has a great deal to offer and will at- 
tempt to so demonstrate. In the forthcoming sec- 
tion, I will trace the development of SOL. followed 
by discussions of possible relationships to language 
assessment (hense. LA). If all that is seen in the fol- 
lowing pages is not precious metal, hopefully it will 
not be fool's gold. 

Development of Sociolinguistlcs (SOL) 

Sociolinguistlcs has always existed, to the extent 
that those who described languages also included 
information on the location and characteristics of 
its speakers, as well as m what contexts they em- 
ployed the languages of a community or a nation In 
'ii'iying this, we h.we really furnished a simplified 



definition of this study which attempts to combine 
both the linguistic and the social dimension. In 
more technical terms, we might also describe it as a 
study which addresses itself to language behavior 
and its societal concomitant. 

Following World War II and America's much- 
broadened geopolitical position, it began to be real- 
ized that our parochial and myopic stance toward 
language and area studies was out-dated. These 
studies blossomed up to the late 1960's when the 
revolt against school requirements of any sort 
almost obliterated them. At any rate, linguistic 
scientists were also dissatisfied with the narrow 
perspective of language study and analysis, and 
decided to join hands with social scientists, espe- 
cially sociolinguistlcs. with or without hyphen. Ac- 
tually the term had first been used in 1 946 by Haver 
and Eva Cuhe. the former a social scientist, the 
second a speech teacher (Curie and Cune. 1952). 
Scholars from linguistics and the social sciences, 
particularly sociology, organized two meetings, 
both occurring in 1964 with the aid of the Social 
Science Research Council, and taking place at 
UCLA (Bright. 1966) and at Indiana University (Lie- 
berson. 1967). No sub-branch of linguistics or 
sociology has shown such phenomenal growth in 
the last two decades as regards both college teach- 
ing and research, and apparently today it is stronger 
than ever 

It is. however, erroneous to give the impression that 
SOL was "invented" by the two groups of American 
scholars convening at Los Angeles and Blooming- 
ton. Many Europeans continue to be amused at. if 
not irritated by. what they consider Amencan "fad- 
dishness" and presumption, as it were, of discover- 
ing all that is modern and good in linguistic science. 
Indeed, it has been a general tendency of both Brit- 
ish and European schools of linguistics that they 
have traditionally incorporated much more extra- 
linguistic information than has been the case here. 
By way of example, in Great Britain. Firth, under the 
influence of anthropologist Malinowski. accorded 
the cultural component a significant role in his theo- 
ries. Halliday a contemporary Briton, has been 
greatly concerned with the compartmentalized 
roles that language fulfills in society as he devel- 
oped his version of functional grammar. Basil Bern- 
stem (1 971) at the University of London. Institute of 
Education, onginally basing himself on research car- 
ried out on middle- and lower-class 10-12-year-old 
schoolboys, developed his own model of SOL 
which, in bnef. holds that the speech of working 
class youngsters consists mostly of juxtaposed 
simple sentences representing a restrictetl code." 
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By contrast Bernstein claims that middle-class 
pupils produced an "elaborate code," marked by 
complex sentences showing embedding, subordina- 
tion, and the tike. Bernstein's theory in its varied ver- 
sions has exercised a powerful influence among 
those working on intervention programs, but there 
is continuing controversy over his views as "elitist." 

In Europe >r would be impossible to enumerate 
those wJro have embodied sociocultural concern in 
their ^proaches to language science. The Prague 
School before World War II was careful to regard 
language in a social context, a trend which present- 
day practitioners such as Jakobson and Garvin 
have continued in the United States. In France, as 
early as 1956. Marcel Cohen had published his Pour 
une sociologie du language (For a Sociology of Lan- 
guage), but his clarion call was not heeded. Mean- 
while, in the United States, there were scholars like 
Einar Haugen. author of Bihngualism in the Ameri- 
cas (1956). a guide considered by many unsur- 
passed in its conciseness, as well as the Norwegian 
Language in America (1958 and 1969). Both these 
works are unquestionably sociolingutsitic in scope 
and purpose, while the latter remains to this day 
one of the 'best, if not the b?str descriptions of an 
ethnic dialect m America. Finally, the late Uriel 
Weinreich's monumental Languages in Contact 
(1953). again addressed the social context through- 
out its rich material not only on the Romantsch of 
Switzerland, m contact with German. French and 
Italian, but also in the other numerous contact situa- 
t»on.s described by him Ironically, however, the 
'wprks of Amencan sociologist. Joyce Hertzler 
including both his article "Toward a Sociology of 
Language" (1953) and his A Sociology of Language 
(1964) failed to create a following despite his sound 
ideas that both sociology and linguistics ought to^ 
join forces, for they are both quintessentially gene- 
ralizing and categorizing disciplines, more than 
almost any others. All this is by way of avoiding an 
over-simphstic picture of the development of SOL. 

It would be difficult to understand the foundations 
of American SOL without referring to the status of 
linguistic science up to and at the time of its explicit 
development, beginning mostly in the 1960's. Dis- 
satisfied with what they considered the static 
nature of philology, with its predilection for wntten 
texts, a small group of scholars around Leonard 
Bloomfield at Yale, then Chicago, founded in 1925 
the Linguistic Society of America, determined to 
make "linguistics" a S'^ience no less rigorous than 
the natural sciences and devoted to spoken lan- 
guage Amencan structural linguistics as it came to 
be called, was concerned with the stiuctures of 
sound and grammar, described in a special frame- 
work and with symbols intended to bring uniformity 
to linguistic discussions A number of exotic lan- 
gu'iqf^;s w^re investigated through native speakers. 
inclurlir«g many Amerindian languages, some uf 
them known only to a few survivors When Pearl 
Harbor occurred, it was natural that the Armed Ser- 
viCH*-^ turned {u the "new linguists" to set up pro- 
(jrarnri \\\ ovfu 40 languages, known as the ASTP 



(Army Specialized Training Program), for some of 
the 11 million service personnel dispatched 
overseas. 

Bloomfleld's structuralism reigned supreme for two 
decades following 1945, the war's end. Although it 
brought a breath of fresh air and many innovations 
still valid today. Bloomfield's scientific aspirations 
created a bugaboo in "mentalism" or the so-called 
attempt to apply psychological suppositions, or 
extra-linguistic phenomena in interpreting linguistic 
ones. This was fully challenged by Noam Chomsky 
in his Syntactic Structures (1957) which became 
the manifesto of generative-transformational gram- 
mar or "TG," a theory which in its turn is questioned 
today, but which is still very powerful. TG at first 
turned its back on the phonological fixation, as it 
viewed it. and took a dim view of "phonemics." pre- 
ferring to concern itself with syntax or grammar 
Over-simplifying. Chomsky considered language at 
several levels, and not in the form of linear strings 
or symbols with meaning or sounds with meaning. 
At the base was "deep structure." roughly the 
semantic component or the message. A series of 
transformations expressed in a medium borrowing 
from symbolic logic mediated between the deep 
and the "surface" structure, the latter being what 
the speakers say or write. Language is. then, a two- 
way process in which messages are first encoded, 
then decoded. Chomsky has upon occasion stated 
that linguistics is. or ought to be. a branch of cogni- 
tive psychology, a stance which is anathema to 
what might be called "anthropological linguists." 
those following somewhat m the descriptive tradi- 
tion of Boas. Sapir. Bloomfield. and a group of bril- 
liant anthropologists aggressively concerned with 
language. 

With early or vintage Chon.sky at least, the social 
context of language. fc3r diffenng reasons, fared 
even more poorly than with Bloomfield. because 
structural linguists could at least take refuge in an- 
thropological linguistics and thus not be accused of 
the error of mentalism. In his turn. Chomsky wrote 
in Aspects of Language and elsewhere, that the 
business of linguistics was to study the speech of 
an "ideal speaker-listener" in a completely homoge- 
neous speech community who knows the language 
perfectly" {Aspects of Syntax, 1965:3). Although 
fortunately, his contemporaries have chosen to 
ignore or take this rather lightly, it is clear that SOL 
could not thrive in either climate and to a large 
extent owes its existence to these constraints on 
the purview of linguistics. William Labov. largely 
working in the Chomskian framework, has suggest- 
ed more than once that if the social context were 
adequately taken into consideration by linguists, 
there would be no need for a sub-di3cipline such as 
sociolmguistics 

MeanwhilH during all t iis time, where m tne world 
was dialectology? Witn few exceptions, it still fol- 
lowed the Ifjsurely Sprachat/as approach of 'he Eu- 
ropean masters in a day when a leisurely race al- 
lowed ultra-lengthy questionnaires ar.U ehcitatiojn. 

'jj 



when interest was focused mainly on regional dia- 
lects, and what has been termed the "antiquarian" 
preoccupation aimed at identifying varying terms 
for everyday objects and terms applying to agricul- 
tural and household equipment Nevertheless, it 
would be an injustice to state that Hans Kurath, the 
"father" of American dialectology, in adapting Euro- 
pean methods to the U.S. regional atlases did not 
improve upon the patterns, cutting out some of the 
antiquanan concern and stratifying informants into 
th''ee categories according to socioeconomic 
status, age and education. It was also at the Uni- 
versity of Michigan, where Kurath held sway, that 
leading investigators of language diversity, like 
Raven McDavid anct^oger Shuy to name a few. ac- 
quired their basic training. The uniform procedure 
used by the principal investigators of the regional 
atlases was a decided plus in avoiding general 
chaos. Nevertheless, a goodly number of dialect 
geographers joined the movement to explore and 
describe social dialects, while others chose to con- 
tinue with revisions and modernizations of methods 
(cf. Allen and Underwood. 1971). 

Social history in the United States was. however, 
ahead of language scholars. As long as the Melting 
Pot concept had remained a firm foundation of edu- 
cation, immigrants and minorities needed only to 
become Americanized, getting rid of foreign cus- 
toms, languages, and. if possible, telltale foreign ac- 
cents. When, however, social forces turned this 
around, rejecting the Melting Pot. and proclaiming 
in the Civil Rights Act of 1 964 that our goal was in- 
stead "cultural pluralism" within our republic, a host 
of movements for ethnic and cultural recognition 
and rejection of enforced assimilation brought new 
demands and pressures upon our educational and 
scholarly establishment. These were reinforced by 
the passage of a great deal of landmark legislation, 
including various "titles" for the remedy of educa- 
tional disadvantagedness. and particularly Title VII 
of the Elementary andjBecondary Education Act 
(ESEA). popularly known as the Bilingual Education 
Act in 1967. Of utmost importance was the Lou vs. 
Nichols Supreme Court decision in 1 972. mandating 
bilingual education as a bridge toward the acquisi- 
tion of English. The burden was then placed on the 
school system and related entities. Over twenty 
states had to repeal existing legislation forbidding 
any language of instruction but English. In all this, 
the tenure of President Lyndon Baines Johnson 
may be considered the watershed of contemporary 
socioeducational legislation aimed at remedying 
long-existing social inequities. 

All these developments brought the study of "social 
dialects" into the center of attention, also creating 
an unprecedented demand for more data and in- 
sights into the language patterns of the impacted 
populations of the inner cities, the rural poor. and. 
above all. the ethnic minonties There arose urgent 
needs in ethnolinguistic data on at least 50 of 
America's more than 100 ethnic groups Often it 
was easier to find such information on the lan- 
O „ quaqes and dialects of the Somoas. New Guinea. 
EKJC the Dravidians or southern India 



As the dialects of these minorities had largely been 
brushed under the carpet, few scholars and teach- 
ers had dared to devote themselves to such margi- 
nal fields. It was the sociolinguists who eagerly 
stepped into the breach so that the 1960's and 
1970's witnessed a rich number of studies on non* 
standard vaneties of speech, particularly Black Eng- 
lish and and varieties of U.S. Spanish. There is little 
question that the funding by government agencies 
made possible strides in SOL which the shrinking 
budgets of academia would not have supported. In- 
dividuals of SOL orientation are commonly consul- 
tants on the numerous bilingual projects. Head 
Start. Project Bravo and other intervention efforts 
supported by federal and state agencies. This sec- 
tion will be devoted to describing briefly the vanous 
models for the investigation of linguistic diversity 
which is the quintessence of SOL. 

The study of SOL has been frequently attacked tor 
really having no parameters and trying to be all 
things to all men. There is truth in some of this, but 
if SOL had not been developed, something like it 
would surely have taken its place. This branch or 
sub-branch of linguistics and of sociology coexists 
with a number of other fields, particularly anthropo! 
ogy. ethnology, psychology, neurology, speech 
communication, and others. A certain segment of 
formal linguists are indeed hostile to it, feeling that 
sociolinguistics is too watered down to be consid- 
ered a serious science, while still others consider 
SOL as one of the recent models of linguistics itself. 

Is SOL itself cultivated in a uniform manner? By no 
means, since its basic requirement is only that both 
social and linguistic factors be considered. At its 
best, it tries to establish, when possible, interrela- 
tions between both dimensions. A very useful 
taxonomy has been provided by Walt Wolfram (per- 
sonal communication). He divides the field into 
three general patterns, as fOflows: (a) the an- 
thropological model, (b) the linguistic model, and 
(c) an intermediate model falling between the two. 
Regarding the first model, its most distinguished 
exponents are Dell Hymes and John Gumprez. 
Hymes argued that if linguists would only broaden 
their concept of the systems of language to account 
for the functional varieties of speech, there would 
be no need for any new name for his proposed 
study — "linguistics" alone would do. Within five 
years, in "Models of the Interaction of Language 
and Social Setting" f1972). he attempted a some- 
what more explicit blueprint for his model. "Speech 
situations." "speech events" and "speech acts" 
should be differentiated and just as explicitly ana- 
lyzed and described in terms of rules as is the case 
with linguistic phenomena. The total "speech 
economy" should become fair, and indeed, manda- 
tory game for linguistic research. In a clever 
mnemonic device. SPEAKING, he included some of 
the primary components of any speech event, 
proposing that these be incorporated somehow 
within the framework of generative grammars 
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It is. to a large extent, as a catalyst that Hymes exer- 
cises a continuing influence upon this new, "more' 
aware." linguistics. Through his prolific writings 
and his lecturing, he is known throughout the 
world. Nevertheless, as he himself has admitted 
(personal communication. 1971). the main short- 
coming of his approach ~ which he is reluctant to 
term a true model — is its relative lack of use in 
actual field situations. An updated version of the 
Models paper also appears in the volume. Direc- 
tions in Sociolinguistics (Gumperz and Hymes. 
1972). Hymes edits the journal. Language in Socie- 
ty, published at Cambridge University, providing 
sociolinguists the world over with a much needed 
and respected forum. 

Turning now to the linguistic model, we find a well- 
known author. William Labov. who entered sociol- 
inguistics through the field of language theory. 
While a student of the late Uriel Weinreich at 
Columbia University, he developed an interest in 
problems of linguistic variation and change, forming 
the hypothesis that, despite the long-held neo- 
grammarian view, change could as effectively be 
viewed in progress (synchronically) as in retrospect 
(diachronically). 

Field work on the varieties of English on Martha's 
Vineyard, an island off Massachusetts (1963). and 
among the ethnically and socially diverse inhabit- 
ants of New York City (1966). enabled him to devel- 
op a methodology to identify and irsasure language 
change as it was occurring. To many, if not most 
persons in the field, his "Social Stratification of En- 
glish in New York City." with its convincing presen- 
tation of the concept of the "linguistic variable." 
stands as the fundamental guide to the contempo- 
rary research on linguistic variation and change. 

Thoroughly familiar with the principles of modern 
general lirjuistics. Labdv has become increasingly 
dissatisfied with a theoretical orientation that as- 
sumes excessive linguistic homogeneity in the 
speech patterns of any givein community. Rather 
than assume an "ideal speake^-h^arer'^or generalize 
to the community on the bapis of working with a 
small number of informants. Labov assumes lan- 
guage is inherently variable and that the only viable 
means to measure its use is to measure it in use. 
Further, he expects that a great amount of that 
variability which previously was dismissed as "free 
variation," in fact adheres to definite patterns deter- 
mined by the linguistic environment on the one 
hand and the social environment on the other. 

One of Labov's earliest theoretical contributions 
was the isolation of the linguistic variable. From the 
study of Martha's Vineyard on. this has simply been 
a high-frequency variant (in most of his studies, 
phonological) which turns up with such frequency 
m all styles of everyone's speech that, for practical 
purposes, it cannot effectively be suppressed. It 
constitutes sociolinguistic data. When correlated 
with such demographic factors age. sex. and 
socioeconomic status and then further correlated 



with factors relating to style and discourse, interest 
4fig_patterns of usage emerge for the speech 
community. * , 

The variable rule first appeared in the exhaustive 
discussion of the findings of Labov. et al. (1968). It 
constitutes a device to relate the observed patterns 
of usage of linguistic variables to the format of writ- 
ing actual grammars and has far-reaching implica- 
tions for general linguistic theory. These rules are in- 
tended to supplement the bulk of the rules, mostly 
invariant, in a grammar, specifying the effects of 
the total environment — linguistic and sociolinguis- 
tic — bn the output of the grammar. 

Labov and his associates found that New Yorkers 
realize /dh/ and /th/ under certain conditions not as 
continuants, but as the highly stigmatizing stops, 
/d/ and A/, respectively. He would formulate the ap- 
propriate variable rule like this. 



-f cons 



- voc 



- strid 

- back 

4- cor 



This rule departs from the standard Chomsky-Halle 
format only in that the additional convention of the 
parentheses around the feature to the right of the 
arrow indicates variability. So that they might be ap- 
plied properly, the grammar must be supplied with 
an index of variability. 0== 1 " for each rule so 
marked. Assuming that a rule actually were "cate- 
gorical." that is. applied universally with no effective 
constraints, the value of k (which indicates the 
constraints) would be zero. However, this would not 
be the case with these rules, and degree of variabili- 
ty would be expressed as a function of the con- 
straints of both style and socioeconomic class, as in 
the following formula: 



k -f(SEC. Style) 
o 



= a(SEC)4-b(Style)4-c 



The precise values inserted into the formula for 
each linguistic variable would result from meticu- . 
lous observation of actual speakers from the speech 
community in question as registered in real 
situations. 

A number of younger and mid-career scholars have 
chosen to align themselves with the Labovian 
model. What we might call the "Washington 
school" is comprised of Roger Shuy. Walt Wolfram. 
Ralph Fasold. Joan Baratz (now in New York City). 
Orlando Taylor. David Sankoff and Henrietta 
Cedergren (1974) (in Canada) and others who have 
been developing some modification of Labgv's 
model, particularly in the direction of "probabilistic" 
rather than "frequency" approach, with extensive 
application of computers and advanced mathemati- 
cal concepts. Sankoff is a professional mathemati- 
cian. In Los Angeles. Benji Wald has worked on Chi- 
eano Spanish and English in the eastern part of the 
city, while Shana Poplack has concerned herself 
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with Puerto Rican Spanish in New York City and 
John Baugh at the University of Texas-Austin has 
addressed himself to language varieties of the 
Southwest, including Black English. In large part 
due to the diligence and dedication of these "disci- 
ples" of William Labov. Black English has been ex- 
tremely weli-covered sociolinguistically. probably 
more than any language variety. At the same time, 
it is saie to assume that most of this group would 
hardly be flattered to be identified exclusively with 
only one line of research. Space forbids discussion 
of other interests. Finally, an able sythesizer of soci- 
oiinguistic work along the Labovian model, as well 
as a researcher in his own right on Brazilian Portu- 
guese syntax, is Anthony Naro (1981) at the Univer- 
sidade de Campinhas. 

fy/luch like Labov. Gumperz and Hymes. Joshua Fish- 
man has become extremely well-known through 
his copious writings. Of utmost importance, both 
for the d^ita secured and the research design, is the 
Bifingualism in the Barrio project, performed by 
Fishman and a number of associates (Fishman. 
Cooper and Ma. 1972). This seems to be the most 
comprehensive study made of the bilingualism of 
any group In the United States, or perhaps the 
world. Fishman. trained as a social psychologist, 
makes use of a very broad battery of techniques 
from both linguistics and the social sciences. The 
study focused on 431 Spanish-English speaking 
Puerto Rican subjects from 90 families on three 
adjacent blocks of Jersey City. They were mostly 
young unskilled workers with little formal education 
and from the lower socioeconomic status. For con- 
trast, stratified samples were also taken of intel- 
lectuals, high school students and other groups 
from the Puerto Rican community ranging into New 
York City Extensive ethnographic information was 
gathered on the subjects, including demographic 
data and details on life-style orientation. 

The elicitation battery was extremely broad, includ- 
ing self-report on language fluency, language loyal- 
ty and the like. Open-ended interviews supplement- 
ed this data, particularly for prechse details on the 
relative use of English vs. Spanish in Fishman's five 
hypothesized domains of home, neighborhood, 
church, school and work. Use was made of a small 
inventory of linguistic variables, along Labovian 
lines, m both SfSanish and English and based partial- 
ly on Tomas Navarro Tomas's studies of Puerto 
Rican Spanish dialectology. 

The language output elicited from each subject was 
rated by independent judges along the following 
dimensions: 

1. Accented speech, with a seven-point scale in 
which the highest scores denoted Spanish 
dominance, the lowest. English and a medium 
score assigned to "balanced bilinguals"; 

2. Reading aloud of selected passages, similarly 
rated; 



3. Writing, based on self-report by one household 
member on a three-point scale, subtracting En- 
glish scores from Spanish, with positive scores 
indicating Spanish dominance, negative. 
English: 

4. Spanish repertoire range with respondents, 
globally rated on a four-point scale, the top 
figure indicating control of a number of styles. 
These styles were defined along Labovian lines 
and consisted of word list reading, passage 
reading, careful speech and casual speech; 

5. English repertoire range, identical with the 
above, but on a five-point scale. 

Attitudinal testing was modeled on Lambert's tech- 
niques, and subjects were asked to make value 
judgments, along a polarized Osgood semantic 
differential-type scale, of the personal characteris- 
tics of speakers heard on taped samples. One inno- 
vation, moreover, was the testing in which similar 
procedures were employed to evaluate the con- 
textual appropriateness of the English. Spanish, or 
code-switched variety of speech on taped samples. 
The significance here is that these researchers dis- 
played a departure from traditional unrealistic ten- 
dencies to regard bilingualism as the opposition of 
two standard languages, rather than as several 
coexisting varieties within languages. 

Elaborate use of such statistical techniques as 
factor analysis, regressive analysis and others per- 
mitted evaluation of a large number of correlations 
among the numerous socio-educational and linguis- 
tic variables represented in the study. 

Initially it should be pointed out that Andrew 
Cohen's Redwood City study is one of the few socio- 
linguistic studies on a Mexican-American popula- 
tion (1 970). Subjects were twenty low-income first 
and second grade Spanish speakers and xen middle- 
income English speakers of the pilot bilingual edu- 
cation project in Redwood City. California. Research 
design represents a combination of techniques 
adopted from the Fishman Puerto Rican project 
with guidance from team member Robert L. Cooper 
of that earlier project and suggestions from Wallace 
Lambert's well-known attitudinal investigations. 
The elicitation battery consists of five instruments 
administered by bilingual interviewers to individual 
pupils and their parents, respectively. These include 
demographic items, attitudes toward the languages 
and cultures, word-association tests for Fishman's 
five domains, and an ethnic image test based on the 
Osgood-Tannenbaum model. 

Cohen's linguistic observations are mostly im- 
pressionistic, but valuable in their insistence that 
deviant forms may well be socially significant in 
themselves as expressions of their users' felt cultur- 
al identity and should not be automatically assigned 
to contrastive standard language differences or the 
"culture of poverty." A further distinctive feature of 



the study is the parallel set of questionnaires for 
parents, instead of relying solely on children as 
subjects. 

Today the field of linguistics has been described as 
one in which so much reappraisal is going on that it 
is in a state of utter chaos. It is perhaps because of 
this ferment that so many look to SOL. whatever be 
the prototype, in order to save their sanity. Allen 
Grimshaw. a sociologist, has written of SOL as a 
hybrid discipline with a short and largely atheoreti- 
cal history (1971: 135). but that was years ago. 
Nevertheless. SOL has one redeeming feature and 
that is its commitment to field study of different 
populations. It is based not on speculation, biit the 
elicitation and analysis of data. 

C . Toward a Socioling u istic Orientation 

We have- sketched the trends in American linguis- 
tics, then moved on ta the development of SOL. 
going from there to research on dialectology, 
formerly concerned almost exclusively with region- 
al dialects, but which has by now added a new 
dimension — that of social dialects. At any rate. 
SOL has shown a very hardy growth, so that it 
covers not only social dialects but also language 
policy and planning, reading problems, literacy, and 
speech communication. At many points, it is some- 
times difficult to demarcate where one study 
begins and the other ends, but I would consider this 
a plus rather than a minus. Just as national deci- 
sions are said to be too important to be left exclu- 
sively to the military, SOL problems are far too sig- 
nificant to be entrusted exclusively to narrow-gauge 
linguists. With over 5,000 languages and countless 
dialects on Earch. yet only some 250 sovereign na- 
tions to house them, our agenda of problems of lan- 
guage and dialect In today's society contains all too 
many priority items. 

It should be emphasized, moreover, that more than 
being a sanctified body of facts and principles. SOL 
is at Its best an enlightened orientation, that Is. a 
general attitude or approach that attempts to view 
each language problem in the light of its own socio- 
cultural setting. 

Unfortunately, the very speed with which the 
mushroom-like growth of the bilingual education 
movement has moved has acted to prevent the ap- 
plication of SOL where it could be helpful. I.e. in 
general bilingual/bicultural education operations 
and especially In assessment. The world term "socl- 
olinguistlc" or "soclocultural" are bandied about a 
great deal, but very often that is as far as it goes. 
SOL has become a "must" when one speaks about 
bilingual/bicultural schooling, much more than a 
concept that is applied to any large extent. What 
passes for SOL In many teacher training programs 
IS merely warmed-over course material on culture 
and at its worst might be termed the "taco trail" or 
Mexican hat dance approach, providing pleasant 
entertainment through dances, songs, and informa- 
tion on exotic customs. Something more preferred 



is obviously needed, which will walk a tight-rope 
between engaging in propaganda or partisan ad- • 
vocacy for other cultures and systems and describ- 
ing variations in them, avoiding dangerous generali- 
zations which do not take into account differences 
on ethnicity, socioeconomic status, and other fac- 
tors in a given community.'region. or nation. 

Addressing assessment for a moment or so. I would 
say that this is virtually the most difficult operation 
expected of bilingual ists and school personnel, 
since it consists, by at least conservative estimates, 
of so very many variables — a couple of dozen at 
the most up to a hundred or more. Accordingly, an 
" instrument that would address all the variables 
would become so complex that it would be 
counterproductive. 

At the 1967 International Conference on the De- 
scription and Measurement of Bilingualism at 
Moncton, Canada, several distinguished colleagues 
addressed themselves to this. E. C. Malherbe. Union 
of South Africa, declared: 

It is doubtful whether bilingualism per se can 
be measured apart from the situation in which 
it is to function, in the social context in which a 
particular individual operates linguistically. The 
only practical line of approach to this com- 
plicated problem which I can suggest is to 
assess "bilingualism" in terms of certain social 
and occupational demands of a practical 
nature in a particular society. Here again the cri- 
terion is to be "bilingualism for what." Purpose 
and function is the only determinant (Malherbe. 
1967:50). 

Thorny technical obstacles in the way of assess- 
ment were also pointed out by Bertha Siertsema, a 
Dutch linguist. She even went so far as to state: 

As long fab the different intonational features 
can alter the entire meaning of one and the 
same "phatic" utterance, can even change it 
into its opposite as in irony or sarcasm, what 
' sense is there in measuring phatic material 
only (supposing all the time that this would 
indeed be possible)? Had not we better drop 
the whole idea of "measuring" in the study of 
bilingualism? (Siertsema, 1967: 155) 

It is time now to present a basic stock of SOL foun- 
dations which are pretty generally accepted in the 
field. It is not felt necessary to indicate origin or 
attribution of the concepts, since they are well- 
known in the literature. 

Cluster A: Language Is Flexible and Varied 

1. Language is not static, but dynamic, ever 
changing 

2. Variation in language Is much more common 
than homogeneity (cf. Labov). 



. 3. More of the earth's dwellers Inhabit regions of 
language diversity where there is more than 
one language or dialect than those that live in 
fairly homogeneous regions, such as Japan. 
Uruguay. Iceland: 

4. When languages are in contact, they usually in* 
fluence each other's vocabulary features. In ex- 
treme cases, they may become a mischsprache 
or mixed variety which h^s been common in 
most immigrant languages spoken in the 
United States. In most cases, at least, there will 
be code-switching, often at the subconscious 
levek 

5. All languac[j5S. as far as we know, have a 
number of styles, alternating in their degree of 
formality vs. informality. One may think of Ian- 

^ 9uage varieties, a term which we prefer to dia- 
lects because of the pejorative connotation of 
the latter term. 

implications: In Cluster A. we encounter a fairly rela- 
tivistic view of language, rather than an elitist one. 
which regards language a$ a set of sacred symbols 
which must not be defiled and which must be 
guarded in its pristine form. As in the world of 
nature, language changes as a result of language 
contact arjd other forces, such as deliberate mea- 
sures by governmental bodies, or even for reasons 
difficult or impossible to fathom — sounds and fea- 
tures change in certain directions. 

Teaching and testing along "puristic" or prescriptive 
lines tend to aid and abet the already widespread 
tendency toward ethnocentrism and regarding the 
variety of speech spoken by some elite as inherently 
superior to all otners. 

Cluster B: Language in Society 

1. Language occurs in a societal context and not 
in an abstract vacuum. 

2. Language is in itself neutral, but values and 
qualities-BTB^signed to it by people. 

3. Language generally follows the stratification of 
small, specific groups (small groups) of society 
so that the way we speak and write constitute 
immediate markers by which people judge who 
we are and from whenpiS we come, both 
geographically and socially! r 

4. Language can be studied.^as Fishman suggests, 
both as "macro-sociolingoistics." or as the role 
of language among grc/ups and organizations 
including regions ar/d nations concerned 
mostly with the speech circumstances of the 
speech behavior o^ individuals. TNs gives 
added perspectives/and a fuller range of views 
of the different role^ that languages play at vari- 
ous levels. 



5. A specific language viewed sociolinguistically 
is more easily seen as a linguistic continuum, 
with informal, nonstandard features at one ex- 
treme and with formal ones clustering more 
toward the other polarity. Thus, in the middle, 
we may posit an everyday style, containing 
both informal and formal elements. 

Informal. Neutral Formal. 

Lower Prestige ^ ^ Higher Prestige 

Features Everyday Style Features 

Bilingual communities may have different languages at each point. 

Implications: Both teaching and assessment are ren- 
dered more complex in view of the continuum. As 
Labov says, there are presumably no one-style 
speakers anywhere in the world. Teaching and as- 
sessment need to be constantly guarded against 
the excessive artificiality marking so much instruc- 
tion and testing. The answer is. of course, not in the 
direction of "anything goes." but rather in that of 
exposing the students to variation through their 
specific samples, which they may in turn learn to 
"mark." both as a class or in a society-testing 
heuristic. 

Cluster C: Domains of Activity 

1. Language may be regarded from many view- 
points, such as in prose and poetry studied as 
literature, or sociolinguistically from the stands 
point of the domains of living in which, let us 
say. each of a language pair or set ar^ utilized. 

2. Sociolinguistics is particularly interested in 
which domains specific languages or language 
varieties are utilized. Very commonly, a lan- 
guage of lesser prestige may be used in infor- 
mal domains, such as the home or the neighbor- 
hood, while the dominant or prestigious lan- 
guage may be employed in church, school, and 
the work place. 

3. Social changes and new alignments change 
from colonial status to independence, which 
may completely alter the above relationships. 

4. A language or language variety may. therefore, 
be made to serve whatever function for which 
it may be needed." 

5. At the same time, language planning may be 
necessary before a language used only for infor- 
mal purposes can be equipped to function in 
formal domains. Many states often have to take 
specific measures to provide new and special 
technical vocabulary in line with needs of 
modern government and technology. 

Implications: The above further extends the picture 
of language. This broadens the scope of one's 
knowledge of places where language is not standar- 
dized and has been in literate use for many years. 
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Languages which are only used for informal, every- 
day spoken purposes are called vernaculars. To a 
large extent. Spanish in the United States South- 
west has a vernacular role, and English serves for 
formal, technical purposes. 

Cluster D: The Variables of Performance 

1. Age. sex. language, ethnicity, and socioe- 
conomic class are some of the main variables 
influencing performance in a given language. 

2. Situational factors are another sort of variable. 
In general, sociolinguists attempt to have this 
occur in as natural a place as possible. 

3. Psychological variables are still not fully under- 
stood, but research and observation show that 
the following are influential: (a) loyalty to a 
given language or culture; (b) attitude(s) 
toward a foreign language/culture; (c) atti- 
tude(s) toward a foreign language/culture, 
either negative or positive; (d) motivation for 
studying the language, be it instrumental or 
integrative. 

4. Coincidental factor(s). such as family crisis, ill 
health, anxiety neuroses in test-taking, may be 
very influential. 

Implications: As the above constitute a minimal in- 
ventory of variables, it is already evident that varia- 
bles of performance are both numerous and. in 
many cases, factors or conditions not of the stu- 
dent> making. It would be impossible to design in- 
struction and assessment so that each variable 
would be explicitly taken into consideration. Ac- 
cordingly, as far as assessment goes, it is important 
to give each student sufficient opportunity to per- 
form at his/her optimum. 

Cluster.E: Desirability of Differing Options 

In general, a sociolinguistically trained person 
would avoid "all or nothing" solutions. Answers to 
language problems in a multicultural setting need 
to be developed from an examination of local condi- 
tions, if this is at all possible. Fortunately, language 
is a field in which wide variation exists, and options 
can vary greatly. To the extent possible, pupils and 
thetr parents need to have a voice in the following: 

1. Since the home language is already known, all 
efforts should be expanded in mastery of the 
dominant language (here. English). 

2. The choice can be a "monoliterate" one. in 
which the home language or dialect is utilized 
for discussion of cultural materials, but only the 
mainstream language is used for reading and 
writing skills. 

3. Instruction is given in the minority language, 
only for developing reading and writing skills. 
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4. A part of the curriculum might be devoted to 
the cultural background of the minority group. 

I 

5. A maintenance solutioiji. in which a larger part • 
of time is devoted to ihe' dominant language, 
may be used especially in scientific-technical 
areas. 

Implications: Strategies for dealing with any or all of 
the above solutions are to be found, although not 
always in a highly developejd manner, in the litera- 
ture of bilingualism. There is room for a great deal 
of experimentation, difference of "treatment." be- 
tween an experimental and ^ontrol group following 
a fairly simple research desigln. Many of the answers 
needed by educators havej simply not been re-' 
searched. One thing seems certain — the dominant 
home language usually utilized as a vernacular can 
and does often serve as a useful bridge toward in- 
creasing mastery of formal aspects of the home lan- 
guage. Nevertheless, this d^es depend upon the 
"distance" between formal, standard varieties and 
informal ones. If the distarrce \s too great, it may be 
best to treat the second langiiage as a foreign lan- 
guage. (Spoken Cantonese vs. Mandarin is a striking 
example. Even here a bridge is possible through 
writing and reading, since the .written characters 
communicate to any literate Chinese reader.) 

Cluster F: Community vs. Home^ Standards 

In general, there is. in most societies or regions, one 
language whose speakers fornni an elite. They are 
usually fully in the mainstream.! There "nay be any 
number of "other" ethnic groupi some even more 
numerous than the elite, with lin^ited access to the 
mainstream. \ 

1. Situations facing multilingual pupils vary great- 
ly in different countries and regions. The coun- 
try may have diglossia. that is a Low vs. High 
"form of the same language. 

. 2. What is the purpose of bilingual education, as- 
similation or maintenance of minority 
language? 

3. Does the minority community reject the values 
of the majority or dominant cultur^? 

4. How different is the variety of the dominant lan- 
guage taught in the school from th^t spoken by 
mainstream and minority speakers?\ 

5. Is there a wide separation between the variety 
of the minority language taught in School and 
what the children hear in their own bome and 

community? \ 

\ 

Implications: A wide spectrum of issues evoked 
by this cluster, and they are ones for whiifh there 
are no pat answers. Sociolinguistically. hbwever. 
since language varieties (language and d\alects) 
occupy differing communication roles, instr^uction 
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and testing can afford to vary pedagogic treatments 
and approaches to the testing of language 
proficiency ' 

Cluster G: Ten SOL insights for the Classroom 

More than any one of the ideological clusters 
presented, there is room for sociolinguistic 
consciousness-raising for all involved in the teach- 
ing and testing process. Given any reasonable soci- 
olinguistic exposure, an individual will not view lan- 
guage phenomena as a series of dogmatic 
oppositions, but rather as continua or., in practical 
terms, as a set of alternatives from which solutions 
can be chosen. The following items represent a 
sociolinguistic attempt to arrive at a sociolinguistic 
type of measure or better measures embodying 
SOL awareness. 

1 Teaching and assessment both should empha- 
size language performance in a variety of dif- 
ferent situations. 

2. Although there is a continuing need for numeri- 
cal assessment, performance-based criteria for 
specified situations need to be developed far 
mere than is the case present. 

3. An Indispensable comRonent of any language 
training program wgulq be a realistic orienta- 
tion to the phases bf the target culture which 
must be known in orderlto function in a given 
language without avoidable faux pas. Again in- 
struction and testing should be done through 
the presentation of situations. 

4. As part of the cultural component, there is 
need fpr training and assessment In kineslcs. 
e.g. the use of space, eye contact, gestures, 
etc.. since this varies from culture to culture. 

5. The notion of "only one form can b^ correct" 
flies in the face of SOL approaches suggesting 
artificial contexts for language, which hardly 
correspond to reality. 

6. Knowledge of the regional ethnic culture as 
well as the ancestral culture and civilization 
should not be assumed. Unfortunately, there is 
little agreement on the content of such a 
component, offering here a challenging field of 
research. Little research has been done on 
ways in which children learn and Internalize 
cultural knowledge. 

7. In view of the younger Sges of most bilingual 
pupils, ways should be sought to enhance the 
effectiveness of modalities. The serious draw- 
backs with paper and pencil and such measures 
as the draw-a-man are too well-known to need 
elaboration Closer to a naturalistic situation is 
the coordinated use of videotape and slides pre- 
senting realistic situations and coordinated 
with a synchronized tape or cassette recorder 
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8. A close tie-in should be sought between entry 
assessment, subsequent classroom Instruction, 
and follow-up assessments. Types of individual- 
ized instruction or small abllltyrgroup divisions 
can be useful here. 

9. Sociolinguistic insights from research ought to 
be combined with those from psychollnguistics 
(including neurolinguistics. pedagogy, and 
social anthropology) on a continuing basis. 
These can be incorporated into both assess- 
ment and instruction. 

10. SOL research data in "raw" form may be ambig- 
uous or misleading if brought directly to the 
classroom context. There is need for processing 
such Information, so that interpretations can be 
made relevant to the experience of both teach- 
ers and pupils. 

Implications: SOL research data for assessment and 
related language operations needs careful process- 
ing for the classroom context. 

D. Applying SOL Notions to Language Assessment 
and Relat@cK)^erations 

There are perhaps 1^ sub-fields of linguistics and 
social science which ar^^ amenable to classroom 
application as SOL. This is simply because the very 
essence is perceived In the variability of language 
and speech. Accordingly, it is light-y^ars distant 
from the prescriptive approaches toTangtwrg^ 
which attempt to promote the concept that there is 
almost always one. and only one. "best" or "most' 
correct" form In a language and that It should be 
used in all situations. I have termed this the "unitary 
fallacy." which Is replaced In SOL by the notion of 
"situational appropriateness" as the determinant of 
what is "best" to use. 

In second language teaching, we deliberately teach 
the most commonly accepted standard variety, usu- 
ally of a somewhat formal nature. That makes sense 
for the purpose. In blllngual/blcultural education, 
however, more often than not we are dealing with 
pupils who come to us speaking different dialects 
of a language and. by definition, somewhat non- 
standard varieties. As regards Spanish, which ac- 
counts for the majority of bilingual programs In the 
U.S.. there are three major varieties — Chicano. 
Puerto Rican, and Cuban Spanish, but even a much 
larger number if we'take each national form of Spa- 
nish American language spoken by immigrants 
from Columbia. El Salvador. Dominican Republic, 
Venezuela. Ecuador, and other nations. Just think of 
it. There are nineteen countries where Spanish Is 
the official language: no other language can claim 
such a number At any rate, it is precisely SOL . 
which prepares us better than any other study to 
cope with the variation. We ourselves cannot be 
masters of all varieties, no more than we can be of 
all varieties of Portuguese — European. Brazilian. 
Azorean. colonial African, and U.S. The result is that 
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we program the most versatile of language teaching 
computers — the human being — to deal with vari- 
ation so that the students can escape feelings of In- 
feriority because their speech is different and not 
standard and so that we can put existing language 
skills to use as a bridge to the standard variety. This 
goal is overwhelmingly approved by parents and, at 
the best- is merely giving Individuals another option 
to enr ich their linguistic and bilingual repertoires. 

Among the linguists who have specifically ad- 
dressed assessment as well as a broader spectrum 
of bilingual education Instructional issues, are 
Rosana Sanchez and Eduardo Hernandez-Chavez. 
Their ideas are of special relevance for us (Sanchez 
et al. 1978: 7-63). Professor Sanchez holds that a 
satisfactory assessment battery must address skills 
in the areas of linguistics, cognition, and soclo- 
linguistics. The linguistic dimension calls for an 
examination in the students' language varieties of 



their ability to communicate in and conceptualize 
about topics and situations natural to their expe- 
riential domains. Once these skills are determined, 
the task of the school will be to increase student 
repertoires so that they may express themselves In 
more than one variety of a language when it Is ap- 
propriate to do so. This means, frankly, code- 
switching during interaction with ethnic peers 
since this is the normal way to converse. Cognitive 
skills in computation, abstraction, or sythesis must 
also be assessed as they relate to academic, home, 
or sociocultural settings. All these skills must be as- 
sessed in both languages so that we may be able to 
determine the students' proficiency and enable 
them to develop their primary language stage of ab- 
stract conceptualization (Sanchez.'1978: 51-52). 

In order to illustrate Professor Sanchez's blueprint 
for assessment, we will reproduce here two dia- 
grams detailing her plan. 



LANGUAGE SYSTEM 



Expression' 



.Content, 



' Referent ~ 
Socio-Cultural Reality 
(Language Use) 



^Cognltlve^ 



Xulturah 



Skills 



Application 



Denotations 



Connotations 



Grammatical 
Competence & 
Performance 

Oral or Written 



DIAGRAM I 



Sanchez, 1978: 17 
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DIAGRAM n 
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THE SCHOOL COMMUNITY 



LANGUAGE VARIETIES - 



SOCIOCULTURAL 
PROFILE 



SOCIO-ECONOMIC-CULTURAL 
SITUATION 
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CHARACTERISTICS AND 
EXPERIENCES 







ASSESSMENT OF 

LINGUISTIC-COGNITIVE 

DEVELOPMENT 



ASSESSMENT 
OF LANGUAGE 
PROFICIENCY IN 
BOTH LANGUAGES 



ASSESSMENT OF 
COGNITIVE 
SKILLS IN BOTH 
LANGUAGES 



DEVELOPMENT OF 
SKILLS TO 
SYMBOLIC STAGE 
IN PRIMARY 
LANGUAGE 



CURRICULAR 
PLANNING 



CURRICULUM 
IN BOTH 
LANGUAGES 



z 



DEVELOPMENT OF 
SECOND LANGUAGE 



CLASSIFICATION OF 
STUDENT IN TERMS 
OF PRIMARY 
LANGUAGE AND 
COGNITIVE LEVEL 



Sanchez. 1978:15 
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Eduardo Hernandez-Chavez (1978:47-56) likewise 
would consider that there is much <ponn for im- 
provement of assessment ins,truments; although he 
himself is coauthor of the Burt-Dulay Bilingual 
Syntax Measure, His ideal instrument would seek 
separate ratings for each of the following dimen- 
sions: (1) linguistic. (2) sociolinguistic. and (3) meta- 
linguistic skills. These would be accompanied by 
specific comments regarding each individual's per- 
formance, addressing such items as degree of stan- 
dardness vs. nonstandardness in the speech output. 

As regards the three dimensions mentioned above, 
the linguistic one involves performance in the read- 
ing, speaking, writing, and listening skills of the two 
languages. The sociolinguistic parameter would, in 
Hernandez-Chavez's view, seek to determine the 
pupil's proficiency as well as dominance in the 
separate skills of each language and in the domains 
^ familiar to him or her. Taken into consideration 
would be the fact that ethnic peers normally code- 
switch with one another, while with a monoglot 
Anglo he would resort to such devices as intona- 
tional shifts, gestural patterns, special lexical 
choices, and the like. Quoted aptly. Professor 
Hernandez-Chavez warns that there is hardly ever 
an equal commensurability of language use with 
any two domains in two different languages. This is 
evidenced by such speech acts as asking for a 
favor, a problem that is hardly ever solved by 
translating words directly from say English to Spa- 
nish, or vice-versa. It is. in the final analysis, the cul- 
tural values of the speech community which ac- 
count for the shape that such a formula should take. 

With respect to the metalinguistic dimension, this 
refers to overt conscious knowledge about linguistic 
phenomena, such as grammar and orthographic 
rules, as well as the meanings and values attached 
to regional, social, or literary variants of grammati- 
cal or vocabulary forms. Obviously, if a student is 
unaware of such distinctrons. he would have real 
trouble in interacting with other people oMhe-©til§r > 
culture, no matter how* impressive the linguistic 
component may be in his case. 

The literature of bilingual assessment is by now 
considerable, although time and space forbid our 
doing justice to the entire gamut of experience and 
opinion. That SOL. although widely recognized as 
"important." is poorly or not at all integrated into 
most assessment instruments becomes apparent 
upon even a cursory examination of commerically 
available instruments. 

An examination of the extremely useful 
compilation. Assessment Instruments in Bilingual 
Education: A Descriptive Catalog of 342 Oral and 
Written Tests, prepared by the Northwest Regional 
Educational Laboratory (NDAC. 197B). helps us 
gam a perspective on the extent to which SOL has 
influenced such instruments. For excellent reasons, 
the instruments reviewed focus on language profi- 
ciency sometimes in one, sometimes in two 
-| languages, with little overt signs of the presence of 
SOL The fourth section is titled: "Sociocultural. 



English. Spanish. Other Languages." A total of 41 
instruments are described, measuring aspects of 
students' environment and their attitudes toward 
their environment, particularly the school. Language 
attitudes are also measured by some tests, while 
certain ones also offer parental attitude measures. 
These factors are certainly aspects of SOL. but soci- 
olinguistics goes beyond them. At the same time, 
the attitudinal dimension is also within the purview 
of psycholinguistics and even education theory. 
There is no way to escape the fact that SOL is inex- 
tricably interrelated with other cognate disciplines. 
All in all. this represents positive steps forward by 
comparison with the state of the art in 1968, ten 
years before this compilation appeared. A briefer 
survey can be found in Duncan and De Avila 
(1979:15-50). 

At the same time, a consideration of only commer- 
cial and published assessment instrunrients offers a 
skewed view of practices in the field. A survey of 
what is actually employed by bilingual projects for 
measuring proficiency/asSessment, bas6d in part 
on a sample questionnaire, hias been carried out by 
Stan Seidner In his paper, "Establishing Entry/Exit 
Level Criterra for Language Asse^ssment," presented 
June 5, 1981 at the Ethnoperspectiye Forums, East- 
ern Michigan University (proceedings in 
preparation), Seidner found that in over 66 per cent 
of project directors responding, the practice was to 
use several instruments — representing both com- 
mercial and self-prepared measures. Additional 
light will be thrown upon this matter at the Lan- 
guage Assessment Institute, June 17-20, 1981 at 
the National College of Education, Evanston, 
Illinois. (This institute was organized by the School 
of Continuing Studies of that College, with the 
cooperation of the Illinois State 8oard of Education, 
8ilingual Education Service Center, Educational 
Testing Services and National Clearinghouse for 
8ilingual Education.) 

There is no better way to end this section than to 
underline the extremely difficult task of embodying 
SOL in assessment instruments, to a large extent 
because it is difficult to gain consensus on what 
shape this should take. In a paper also presented at 
the Ethnoperspectives in 8ilingual Education Re- 
search Forum on 8ilingual Education Technology, 
Eastern Michigan University, entitled, "The Assess- 
ment of Language Proficiency in 8ilingual Chilren: 
An Analysis of Theories and Instrumentation* by 
Charles Stansfield, the difficulty of doing justice to 
a test based on a tripartite theory of language 
component, communicative skills, and sociolinguis- 
tic domains is detailed. As Stansfield worksjt out, it 
involves a matrix of 32 different cells, each of 
whose tasks require separate validation. All in alL a 
challenge of utmost complexity is involved. In the 
discussion period referring to Stansfield's paper, he 
went so far as to claim that instruments which 
would attempt to specifically address each aspect 
described above would probably become too un- 
wieldy to administer and analyze (Stansfield 1981, 
personal communication). 
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Thus, we are on the horns of a dilennma. and with 
each effort to expand the scope and depth of as- 
sessment instruments, the complexity of such a 
task IS greately compounded. At the same time, 
many of the thorny issues are illuminated in the 
other papers of this volume, covering papers pre- 
sented at the National College of Education Lan- 
guage Assessment Institute (Seidner. ed.. 
forthcoming). 

E. SOL as Dynamic Heuristics — Personal and Group 
involvement 

As already suggested. SOL offers no panacea capa- 
ble of solving ail educational problems. What may 
be considered its main terminal objective is that of 
sensitizing the individual to discriminate among the 
language varieties and to be more aware of the exis- 
tence of options in the world of language and 
speech. This, far more than the sum total of facts 
learned in Such .training, is what makes it valuable 
sin^e of all human activities, there is hardly one in 
yvnich we engage more than speaking and listening. 
From learning about "language situations" the 
world over, the student of SOL usually develops the 
ability to make judgments about the linguistic situa- 
tions which s/he encounters at home, at work, and 
at school and elsewhere. One of the most common 
exercises in SOL courses is the drawing up of a SOL 
profile of a country, region, or town. Often after stu- 
dents have completed such a profile of their own 
home town, they will exclaim. "My. I never realized 
that we had such different kinds of people, and so 
many relationships among them." This is quite un- 
derstandable in countries like ours in which the 
goal of egalitarianism is reflected in such slogans 
as: "Tall people are equal." or "There's no discrimi- 
nation in this town." Nevertheless, as George 
Orwell wrote in his classic 1984. some are more 
equal than others. Even in a democracy, role, power 
and functional relationships are operative, and 
these can be identified in any SOL course worth its 
salt. 

Ideally. I would personally present the relationship 
of SOL to language assessment semiotically in the 
three following patterns: 

SOL and Teaching/Assessment Models 

1 Formal/tradilional 



SOL 

□ 



2 Mixed 
SOL 



□ 



Teaching Assessment 

□ □ 

Teaching Assessment 

□ □ 



3 Vrjrtdblf? SOI Oriented 

SOL Teaching Assessment 



□ 



□ □ 



Everything considerisd. then, the main goal of SOL 
aside from the actual data presented, is the develop- 
ment of attitudes which permit the individual to 
make decisions about language options in a world 
where some 250 sovereign nations house more 
than 5.000 distinct languages and an almost infinite 
number of dialects and sub-dialects. Stated another 
way. in modern technological terms, the most 
desirable terminal objective or criterion-referenced 
goal is to "program" individuals, be tliey students, 
teachers, administrators, parents or (Community ac- 
tivists. This must be done sociolinguistically so that 
they are liberated from our traditional h^avy load of 
myths, misconceptions, and prejudices about other 
languages and cultures and can make enlighted 
and practical language-cult^re decisions. Having 
given some notion of the virt^jal futility of explicitly 
incorporating SOL into all teaching techniques and 
materials, the alternative is to program the most 
versatile of computers — the human being. 

Indeed, in her fine review of foreign language test- 
ing. Helen Jorstad (1974:223-273) emphasizes the 
extreme difficulty, if not impossibility, of adequately 
testing the cultural component of language. The 
tester is trapped between the Charybdis of selecting 
an inventory of cultural items about which no two 
persons usually agree and the Scylla of involving 
value judgements of a subjective type. What should 
cultural or sociocultural teaching and assessment 
materials and instruments contain? It is a wide- 
open and moot question. 

One of the highest priority problem areas that have 
already received a certain amount of Mtention is 
that of attitudes toward other languages and cul- 
tures on the part of youngsters at different Ftages of 
development. Most of the research on this, as ex- 
emplified by a brilliant group of Canadian scholars, 
has been in the form of the matched-guise tech- 
nique, mostly applied to young adults. There is a 
desperate need, particularly in the bilingual/bicul- 
tural schooling context, for investigations that will 
illuminate the ways in which young Americans ac- 
quire their abysmally negative perceptions of other 
languages and life styles. It is too simple to say 
merely that it is all the fault of the parents. Unfortu- 
nately. U.S. social history reveals in our society 
deeply rooted hostilities toward things "foreign." 
call it jingoism, chauvinism, know-nothingness or 
whatever. These are compounded by now outmod- 
ed preceptions that foreign language/culture study 
has little or no integrative value. A child interested 
in language study is marked as a "lame" by his 
peers and may drop it hastily. As one who spent 
most of his life trying to teach and inspire red- 
blooded American youngsters with my own passion 
for our subject. I threw up my hands after 25 years 
or so of the effort and went into research, lecturing, 
and consulting. Please don't judge me too harshly! 
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The results of our mercilessly competitive interna- 
tional, front has been the retention of a "Let'em 
Learn English" attitude and the image of the ethno- 
centric Ugly American. A recent Russian immigrant 
told me in El Paso that when a Soviet serviceman 
presented himself at the U.S. Embassy in Kabul. Afg- 
hanistan, wishing to defect to the U.S.. there was 
not a single person in our staff there able to speak 
Russian to him. and a special plane had to be dis- 
patched to another capital, almost a thousand miles 
away, to bring a trained polyglot. Of the fifty-odd 
hostages formerly held in Iran, only one or two 
could speak Farsi with any functional level. 

This problem cannot be solved, despite our fondest 
dreams, by trying to devise still another break- 
through, complete with computer intervention for 
teaching methodology or testing. A two-pronged 
attack is needed that will research 'the anti- 
language attitudes and their development and, as a 
second phase, will undertake an aggressive but in- 
formed campaign supported by educational, com- 
munity, and related agencies. This would be in line 
with the American way of influencing 'Our institu- 
tions, effectively applying presS^ure to the proper 
governip'^ntal and political organizations. Unless 
this, howevv-^r. obtains wide grassroots approval, it 
will fail as have such movements as FLES ~ Foreign 
Languages in the Elementary School ~ which 
sparkled in the 1 950's. only to eclipse in the 1 960's. 

As this section does not purport to being a compre- 
hensive treatment of research targets in socio- 
linguistic aspects of assessment, it will suffice to 
add only that the writer sees two additional broad 
areas which he would typify as highest priority on 
our agenda for today and tomorrow. 

The first area is in the realm of interdisciplinary r 
search — precisely at those points where SOL inter- 
sects with psychology, sociology, social anthropolo- 
gy, psycholinguistics. neurolinguistics. and speech 
pathology. The latter field has become unusually 
open to current trends from linguistics and indeed 
has begun to share with us some of its important in- 
sights, particularly as there are problems of varia- 
tion in styles of speaking and performance involved. 
The old days of relerating speakers of ethnic dia- 
lects to retarded classes, special education, are un- 
fortunately not completely behind us. There is 
much more. 

The second area may be seen in the still underdevel- 
oped activity of data-gathering and research at the 
micro level. Language practitioners usually emerge 
from backgrounds in which they have not been ex- 
posed to even the simplest social science research 
procedures. The awe which then results from the 
medium and terminology of statistics, with its bell- 
shaped curves, distinctions between the mean and 
the average, levels of confidence, chi-squares and 
regression analysis is well-nigh paralyzing for us. 
Yet at the same time, as Cynthia Darche-Park point- 
ed out at the 1981 Ethnoperspective Forum in 



Ypsilanti. ILujan & Park) teachers can do important 
types of data gathering which, if properly 
processed, can supply insights not available to an 
outsider with the most sophisticated statistical 
techniques. This is not to attack the validity of re- 
search which employs modern procedures, but 
rather to suggest that there is room for both types 
of inputs. Due in part to the great speed with which 
bilingual education schooling has been organized, 
many practices today — out of sheer necessity — 
are performed in an a priori way. There is no time to 
carry out large longitudinal studies when govern- 
mental mandates and requirements demand 
immediate execution of measures designed to 
relieve disadvantagedness on ethnolinguistic 
bases. No aspect, you will agree, has been quite so 
heavily "under the gun" as the requirement for 
producing instruments or measures for assessment 
of language proficiency and dominance. 

Encouragement of micro-level research can and 
should bring us a great deal of data and figures and 
insights not yet available — all of which are badly 
needed. I am saying here that although focused re- 
search projects are essential, ethnographic or 
broad-gauge investigations can also, by serendipity, 
make us more knowledgeable about our communi- 
ties and the characteristics which distinguish our 
students, parents, teachers and the community in- 
frastructure. Much of what we believe about our 
field is simply based on hearsay given authority by 
dint of repetition. We must open ourselves to learn- 
ing more about a topic such as bilingualism. As one 
of the pioneers. Einar Haugen. has said, in it every- 
thing and anything is possible. A tiny example will 
suffice here. In his important study on "The Neurolo- 
gy of Bilingualism." Yvan Lebrun (1976) brings con- 
vincing evidence that the long-held belief that 
when aphasia caused by a trauma occurs, the 
strongly embedded mother tongue will — if any- 
thing — return at least in part to the patient, while 
the language learned later will recede or disappear 
Research on specific cases shows quite the oppo- 
site frequently to occur. 

F. SOL Research at the Local Level 

The remainder of this paper will concern itself with 
various specific types of research which can be per- 
formed with very small or no monetary outlays, at 
the community level. As Cluster G.^'^'ltem 10 sug- 
gests, the data gleaned c^n be extremely useful as 
a continuing data bank for bilingual/bicultural 
education. 

The experience on which the following Is based 
was acquired with a Sociollnguistic Research 
Team, which was active at the University of Texas 
— El Paso between 1969 and 1979. mostly on a 
volunteer basis involving university faculty from dif- 
ferent departments. Minimum requirements for 
such a team are at least one language specialist 
* and one social scientist or educationalist familiar 
with standard social science research techniques. 



In 1969 several of us colleagues at UT-EI Paso 
decided that we needed to know more about our 
college population to see whether there were sig- 
nificant differences between the Anglo and the Chl- 
cano bilingual students, comprising more than a 
third of the undergraduate enrollment. Our Socio- 
Jinguistic Research Team consisted of an education- 
al psychologist, an educational administration 
specialist, a sociologist and myself as a linguist. Our 
first task was to devise a Sociolinguistic Back- 
ground Questionnaire (Brooks, Brpoks, Goodman & 
Ornstein. rev. 1972) which was then administered 
to the 301 subjects (students), stratified into sixteen 
groups, including Mexican-American vs. Anglo, 
age. sex. year of college, college enrolled in. etc. 
This quota-type sampling amounted to almost 5 
percent of the undergraduate enrollment that year. 
In addition, a similarly stratified sub-sample of 30 
bilinguals was taken, who. in addition to the above, 
were subjected to oral and wntten linguistic elicita- 
tion as Part B of the questionnaire. The basic 
questionnaire, with its 106 items, mostly of true- 
false type, covered demography; self-reports of pro- 
ficiency in Spanish vs. English, as well as domains 
of usage: and attitudes to the respective languages 
and cultures, as well as questions on work ethic, life 
goals, life styles, etc. 

Results of the survey have been described in detail 
through publications by different team members 
and by appeahng in linguistic, social science and 



education journals. (References to some of these in- 
clude: Murray. 1972; Dubois. 1979; Goodman and 
Brooks. 1975; Goodman and Renner. 1971; Brooks 
and Calkins. 1981; Ornstein 1971. 1973. 1975. 
1976. 1976b. 1979; Ornstein and Goodman. 1979; 
and Bowen and Ornstein. 1 976.) 

Depending upon local needs, a sociolinguistic re- 
search team. In addition to gathering and collating 
raw data in different categories, decided upon par- 
ticular targets with the greatest possible relevance 
to bilingual/bicultural needs. Research monies 
were subsequently obtained. By way of example. 
Jaime Lujan and Cynthia Darche-Park have de- 
scribed such efforts in their paper "Collaborative 
Staff Development for Teachers of Bilingual Stu- 
dents with the Teacher as Researcher" (1981). They 
remark: "The notion that classroom teachers are in- 
capable of doing important educational research is 
a bias inherent to research training in many tradi- 
tional research institutions." 

In the UT-EI Paso survey we were mostly interested 
in identifying key sociolinguistic characteristics of 
Anglo vs. Mexican-American students in our highly 
bilingual enrollment Accordingly, our products fol- 
lowed this orientation. One of the most valuable 
outcomes was a Language Situation diagram of the 
entire Southwest, largely applicable to the El Paso. 
Texas-Las Cruces. New Mexico area, (see Diagram 
4). 




REGIONAL VARIETIES (DIALECTS) 



S Colorado- 
N. New Mexico 
Spar.sh 
Koine 





General 




Southwest 




Spanish 



Indian 




German 




Lubbock- 


Dialects 




Polish 




Amarillo 




Arabic 




Texan 




English with 
Predominantly 
Northern or 
Southern 
Characteristics 



General 
Southwest 
American 
English 
Koine 




SPECIAL CODES (JARGONS. CALOS) 



/TeenageX /' 
N^panisy \ 



Street (and Underworld) 
Spanish Koine or Argot 




Occupational 

Spanish 
(agriculturalists, 
smelter workers, 
cattlemen, etc.) 



/OccupationalX /'Underworld'X /TeenageX 
\ English / \ English / \ English / 
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phonological and grammatical variants in the Spanish 
and English of Southwest Chicanes. For pedagogic pur- 
poses, it is best to avoid too much technical jargoh. 





Spanish 


English 


1. 


/ch/ and [sh] alternation 


1. /ch/ and /sh/ alternation ^' 


2. 


hi with retroflex interference from English 


2. /b/ and /v/ alternation 


3. 


/x/ with glottal interference from English 


Q lel Qe ro9li79tlrtn of /y/ 


4. 


Realization of orthographic 11" (sia for silla) 


4. Consonant cluster reduction ar^idf secondary effects 


5. 


Epenthetic /e/ in certain environments (words with 


5. Realization of /O/-/ / and /t/-/ / 




final Stressed syllables ending in 1, n, r, aS Isabele. 






comere) 




6. 


l\l for /e/ in final fiosition following /ch/ Inochi) 


a [dzh]for/y/ 


7. 


Paradigmatic leveling of /e/ to i in infinitive and 


7. Realization of /iy/ 




finite forms of //verbs Ivlstlr, vistimos) 




8. 


Reduplicated plurals, (cafeses) 


8. Realization of /ae/ 


9. 


Overriding English intonation 


9. Stress mislocation. particularly in compounds wfth 


nouns serving adjectival functions 






10. Overriding Mexican Spanish intonation patterns 




Diagram 5 : Inventory of Variables of Southwest Spanish and English 



In line with the above, this worker has been developing 
a sociolinguistic marking system, first presented at the 
12th International Congress of Linguistics* Bolongna. 
and subsequently revised. As the particular bilingual 
classes are a battleground for what forms and usage to 
follow, some system of classifying standard vs. nonstan- 
dard forms in language can be very useful. Along socio- 



Another valuable type of product that can be furnished 
by a SOL Research team consists of inventories of vari- 
ants in local-regioral speech. Diagram 5, prepared by 
the El Paso project, shows constantly updated lists of 



lingui§tic principles, every attempt is made to present 
usage as a set of options, depending upon situation and 
function, rather than attempting to establish one inflexi- 
ble "unitary" form, which is unrealistic since students 
will revert to colloquial patterns as soon as they leave 
the classroom. The SOL marking system devised can be 
seen in Diagram & 



Diagram 6 
A Tentative Sociolinguistic 
Marking System 



I Marking System 

— 4^ — ^ 

Symbol Explanation Markedness Symbol Explanation Markednass 

(According to Dimension) (Accoramg to Dimension) 



ST STANDARD 

fo formal 

pr prescriptive ^ 

no normative, often of "school variety" 

sa sanctioned by recognized authority (academy, etc.) 

of official 

na national 

re religious 2 

ac acrolect(al), higher prestige variety 

sti stilted 

ru rustic, rural 

jou journalistic 



NS NON-STANDARD 

St stigmatized 
ar argot, calo 
pa patots 

ja jargon, specialized code 3 
ba basilect(al), lower prestige, stigmatized variety 
st slang 
ca cart 
cr Creole 

pi pidgin 4 

cd contactual or bilingual dialect interferential variety 

if interferential feature or item , usually not assimilated 

within bilingual community 
af assimilated feature or item, integrated w'thin 

bilingual community 
lo loan, feati^re or item often not part of standard 

language 

hy hypercorrect, to point of non-standardness 






re 




• 


reg 




rep 




so 




vu 




po 


+ ■ 


Ob 




obs 




arc 




hf 




If 


+ 


wp 


+ 






ta 


+ 


su 


+ 




in 




un 




ss 


+ 


ch 


+ 




pe 


qu 



NE 


NEUTRAL (STANDARD OR NON-STANDARD) 




la 


language 




di 


di3lect(al) 


+ 


le 


lect, isolect (replacing dialect^) 




va 


variety 




ve 


vernacular, "unmonitoreri" style of speech 




sc 


specialized code, e g. teenagers, occupational 




sty 


style (write: sty + des) 




CO 


code (write co + des.) 


+ 



St 

SES 

ag 

se 

CO 

au 
at 

IS 

et 
II 

or 
wr 
te 

no 



regional, geographically determined (write first 2 

letters of country or region, regist 3 of la or di. e.g.. — 

FR - France; FRE - French) ±. 

register (write: reg + des.) + 
repertoire " totality of styles or "codes" of 

language(s) controlled by an individual + 

social + 

vulgar + 

pornographic + 

obscene + 

obsolete, obsolescent + 

archiac +. 

high-frequency item jOr feature + 

low-frequency item or feature + 

word-play usuaHy Involving transposed » 

phonotactics, iS.g. Pig Latin + 

taboo + 

status uncertain, lacking further data + 

informal^/ + 
unstable, fluctuating, neologistic. dissapoearing. etc. + 

semairirtic shift, often away from ST + 

chan6|ing, shifting; indicate direction with arrow t 

petrified, fixed, often archaic form + 

quasi + 



SITUATIONAL FACTORS OR DIMENSION 

socioeconomic status (of speaker) 

age. age group (write: ag + des.) 

sex (write: se + des.^ 

context (write: co + des ) 

audience, interlocutors (write: au + des.) 

atmosphere (write; at. + des.) 

isolatmg from "mainstream" (write: is + des.) 

ethnicity, affiliation to ethnic group (write: et + des.) 

language loyalty (write: II + des.) 

primarily oral usage — 

primarily writtan usage + 

tempo, speed of delivery (not normal; write: te + + 

des.) 

notabilia, special circumstances or details to be 
noted (write: no + des,) 
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Students can ultimately learn to make distinctions 
themselves regarding the sociolinguistic concepts 
of "standard" vs. "nonstandard." "formal and infor- 
mal." "written and spoken" and "informal and calo." 
In the U.S. Southwesl. the Pachuco calb. a street 
variety of former delincjuent zoot-suiters. is often 
confused with" Southv^est Spanish and "border 
jargon" or "Te)<-Mex" Sociolinguistic ignorance 
caubes these distinctions to be blurred or confused 
when the fact is that al] these styles of a language. 
Including slang which is quite limited in actual lexi- 
cal quantity, have their functions in certain con- 
texts. The rich and colorful Pachuco c^lb has in fact 
become part of racy, informal Southwest Spanish, 
used for humorous or often identificational 
purposes. 

Instead of the fruitless hammering away that there 
Is one and only one "correct" form of Spanish, or 
whatever language, bilingual teachers can and 
should assist their pupils In perceiving the vast rich- 
ness and versatility of the great culture-languages, 
such as Spanish and English. Children ar^ sociol- 
Inguistically programmed from the very beginning 
of their existence and possibly as early as4oJitero^ 
Unless much of the confusion is sorted out. the vari- 
ation becomes a burden, rather than a blessing. 
Thus in concrete terms, for example. Southwest 
blllnguals can. as a plassroom exercise, "sort out" 
with the teacher such co-occurring forms as the fol- 
lowing and mark them sociolingUistically. 

Calb islreet language) 



Formal 

si 
no 

automovit 
ya partib ei senor 
Dbnde vtve? 



Standard (Formal 
& Informal) 

si 
no 

coche. carro (informal) 
Ya se fue el hombre 
Dbnde vives? 
or 

Onde vtves? (more inf.) 



stmbn 
nets, chaie 
ranfta 

ha se borjb ei bato. 
Ofide viboras?" 
(word play) 

•'snake' - SW Spanish 



b. Nonstandard features of the English em- 
ployed by different socioeconomic groups. 
* c. Interference vs. integrated norms. 

(2) Socio-Attitudinal Data 

^a. Attitudes toward different varieties of re- 

" gional Spanish (other) and English and de- 
grees of "language loyalty" or commitment, 
b. Attitudes toward comparative life-style 
and value systems of Hispano and Anglo 
populations (or other). 

(3) Bilingual Communication Data 

a. Patterns of linguistic dominance. 

b. Distributive role of Spanish vs. English in 
regional communication networks. 

c. Range of codes, styles, registers employed 
and by whom (i.e. typical "linguistic 
repertoires"). 

d. Code switching and stylistic shifting. 

(4) Socio-Educational Data 

a. Community profiles. 

b. Language/culture attitudes of community, 
parents, school personnel toward aims and 
methods of teaching of bilingual larf^pgage 
pair (Spanish vs. English). \ 

c. Testing and assessment procedures\ aind 
their special problems as applied tb all 
types of measurements, formal and inforr 
mal, and aimed at compliance of placej 
ment, exit criteria, improvement of skills 
and fluency improvement (proficiency). \ \^ 

The following diagram (7) then presents the flow-j 
chart of the regional or greater urban sociolinguispc| 
research team network. j : 




Elsewhere (Ornstein. J975:'40-42) the writer de- 
scfibes in fuller detail the organization and function- 
zing of \egional sociolinguistic research team pet- 
works. As noted, shrinking budgets at all levels of 
ipational establishmentlmake it a question 
of shea- survival to develop strategies for maximiz- 
ing funding and individual resources. One of the fea- 
tures Qffthe autt)Or's concept. :which has been com- 
mentedJupon favorably by Cohen (1975:3-4) is that 
of the Sociolinguistic Research Kit. to be distributed 
to collaborating investigators, detailing minimal 
and similar targets for data collection under the fol- 
lowing four rubrics: (D Linguistic. (2) Socio- 
Attitudinal. (3) Bilingual Communication and (4) 
Socioeducational Data/Insights. 
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(1) Linguistic Data 

a. Regional and nonstandard variants of Span- 
ish (or other language) in the fields of 
phonology, grammar and lexicon, accord- 
ing to use by different socioeconomic 
stratas 
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Diagram? 

Proposed Regional/Metropolitan Socioiinguistic 
Research Team Network (U.S. Southwest) 



Linguistic 
Data 




Bilingual 
Communicatjon 
Data 











Socio- 
attitudinal 
Data 



Technical 
Description 
Linguistic 
Grammar 
Sketches 




Socio- 
educational 
Data 



Southwest Socioiinguistic 
Research Units 



Short-Term Results 




Long-Term Results 





/■ 



Lari 


guage 


Cc 


intact 


Uni 


vrersal^ 



Taxonomy 
of Southwest 
Language 
Varieties 



Instruments 

Bilingual 
Profile Index 



Basic Research 

Data 
(stored in info, 
retrieval facilities) 



Re-Orientation 
Efforts 
(output thru lecture 
and publication) 




Span|shr^English 
leachjlng materials / 
(enrichment/resea^roo) 
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DESTINATION (USERS) 

1. ^Tea'^hing Profe^ion, 

esp. Bilingual^ducation 

2. Scholars 

3. . Social-Medical 
Delivery Services 

4. Governmental 
Agencies 



In conclusion, the example of the Southwest U.S.. 
an area marked by vast spaces and often limited 
population and resources, in pooling expertise in 
teaching and research may be illustrative. Since 
1972. beginning at the University of Texas-El Paso 
and meeting in a different site each year the 
SOUTHWEST AREAL LINGUISTICS WORKSHOPS 
(SVNALLOW). have provided forum for presenta- 
tion of pedagogrc and scholarly papers, published 
yearly in the SmUOW PROCEEDINGS. The latest 
may be seen in Elerick. 1 981 . 

No matter how tempting are the ideological dis- 
putes in our field we must, without abandoning 
theory, cooperate to the utmost with classroom 
practitioners forced to produce or apply instruments 
ifor proficiency and other aims upon short notice. As 
academicians, it is our obligation to provide a solid 
research base without which tests will result in in- 
evitable disasters. 

Summary 

In line with the spirit of' SOL itself, we have in the 
preceding pages, largely addressed the spirit of this 
sub-field, which is larger than the collection of facts 
and assumptions usually taught as "sociolinguis- 
tics." At least two notions override and pervade this 



study; (1) Language is inherently variable — not 
rigid and inflexible; and (2) Appropriateness of situ- 
ation or context is a safer guide than the agonizing 
search for the unfailingly "correct" form. Applying 
SOL to assess the preparation and administration of 
instruments is a mighty challenging task, and 
utmost caution is recommended for a variety of rea- 
sons or else a top-heavy Trankenstein" type of test 
whose complexity would be self-defeating will 
emerge. To the exjent possible, instruments should 
be so contructed that some allowance is made for 
varying options. Above all. testers, bilingual teach- 
ers and administrators, should be programmed socl- 
olinguistically to provide the surest guarantee 
against excessive elitism and prescriptivism. At the 
same time, all-out efforts should be made to utilize 
available grant funds and to mount sociolinguistic 
research teams, consisting minimally of a social 
scientist and a language specialist, preferably 
within regional team networks, addressing similar 
targets. An interface needs to be developed beyond 
its present stage, which would "package" SOL re- 
search data gleaned and whose products would be 
particularly aimed at the classroom teacher and 
tester — our first line of defense and progress in the 
bilingual/bicultural education movement. 
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SOCIOLINGUISTIC FOUNDATIONS OF LANGUAGE ASSESSMENT: 

A REACTION 



by 

Guillermo Duron 
Governors State University 



The purpose of this paper is to react to the material pre- 
sented by Dr. Jack Ornstem-Galicia relative to the socio- 
iinguistic foundations of language assessment. Dr. 
Ornstem-Galicia's presentation was organized into 
three areas. The first dealt with a historical perspective 
of sociolinguistics. This section included cumulative de- 
veloprjiental definitions. The second section dealt with 
the theories underlying sociolinguistics. Included in this 
section are the ideas of the contemporary theorists from 
which the discipline of sociolingustics has evolved. The 
final section of Dr Ornstein-Gallcla's presentation dealt 
with the contribution of sociolinguistics to language as- 
sessment. Specifically addressed were issues related to 
bilingual bicultural education and ethnic studies. 

According to Dr Ornstein-Galicla. sociolinguistics is a 
sub-branch of both linguistics and sociology. It has 
been called the sociology of language. Furthermore, 
sociolinguistics is viewed as using sociological methods 
and concepts to analyze the linguistic patterns of 
groups of individuals. Sociolinguistics assumes that lan- 
guage can only be understood and have meaning when 
it is examined in light of the social groups utilizing it. 
Language, therefore, is not fixed in time and space; it ts 
dynamic and ever-changing. The variability of language 
is the subject of sociolinguistics. Noteworthy is the fact 
that over 5.000 languages and over 1.000 dialects are 
spoken. 

Histoncally. sociolinguistics evolved for pragmatic rea- 
sons Sociolinguistics came into focus as a result of the 
increase m the number of people who travel extensively 
and the need for communication and understanding for 
professional reasons. Sociolinguistics tries to negate 
the idea of language as an elite concept, that is. it 
avoids the conclusion that there is a "pure" language. In 
rejecting the elitist point of view which concludes that 
there is only one correct vanation o{ language, it has 
introduced the concept that appropriateness of lan- 
guage IS relative. Since the purpose of language is com- 
munication. It must be situation-specific. An example of 
this is demonstrated by the following which depicts a 
situation and an inappropriate linguistic vehicle to ac- 
company It. 



Situation 

• In a bar in the ghetto 

• Inthebarno 

• Job Interview » 



Inappropriate Linguistic Vehicle 

• King's English 

• Academia Real Spanish 

• Vernacular Speech 



While any of the above linguistic vehicles are appropri- 
ate withm a given situation, they may be deemed inap- 
propriate based on the specific contextual environment 
listed above This illustrates the situation-specificity of 
language 



Several 
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Q ^cvcioi major sociohnguists were 
Drnstein-Galicia as contributing their 



referenced by Dr 
ideas, philo- 



sophies and theories to the emerging field of study. Fish- 
man defined sociolinguistics as the structure that tells 
us who speaks which language to whom. when. why. 
and where. This theorist described sociolinguistics as 
responding to a need for an organized discipline to re- 
place a vacuum that was occuring in the area of 
linguistics. Due to the influence of philosophy, trans- 
formational grammar, etc.. linguistics was becoming a 
science. Other sociolinguists mentioned included 
"purists" such as Chomsky and Haugen and 
"interpreters" such as Blumefield and Brite. 

Considering the cultural milieu that existed while sociol- 
inguistics was developing, the time was right for its con- 
ception. In 1960. Title VII of the Elementary and Secon- 
dary Education Act was established: in 1964. the Civil 
Rights movement was underway; and in 1974. Lau vs. 
/V/c/?o/s called for equal educational opportunity for stu- 
dents from non-English backgrounds. Legislation occur- 
ing at the time was favorable toward linguistic diversity. 
Sociolinguistics developed as the nod was given to re- 
searching "standard" as well as "non-standard" lan- 
guages in order to interpret their cultural significance 
The results of this research were ultimately used to pro- 
vide insight and mforrnation to educators of minority 
and non-mmority students. 

Sociolinguistics. according to Dr Ornstem-Galacia. has 
contributed significantly to language assessment. As a 
field of study, sociolinguistics has softened the outra- 
geous demands of some English teachers. These teach- 
ers were guilty of imposing elite standards of language 
regardless of the cultural or contextual milieu. Aware- 
ness of language and its various roles have resulted 
from sociolinguistics. Accordingly, in terms of assess- 
ment, lesser ability is not equated with nonstandard 
speech or language patterns. Sociolinguistics has made 
teachers and administrators aware of bilingual educa- 
tion as a meaningful, effective, instructional vehicle. 

As a result of the influence of sociolinguistics. options 
are available to producers and consumers of language. 
There is not one unitary, monolithic language. Sociol- 
inguistics provides a means to look at other disciplines 
and allows for a point of view that legitimizes those op- 
tions. Unfortunately, a problem exists when language 
assessment utilizes a sociolinguistic perspective. Be- 
cause no two people agree on what should be assessed 
sociolinguistically. the result Is global rather than specif- 
ic instrumentation. Practicality demands that much of 
the variation in the population be Ignored. As such, 
tests cannot be profitably made and marketed that take 
into consideration the experiences of all sub-groups. 
However based on the observations of sociohnguists. 
the results of currently existing instruments should be 
. carefully considered in light of the linguistic and cultural 
experiences of the examinee. 
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LANGUAGE PROFICIENCY ASSESSMENT: 
RESEARCH FINDINGS AND THEIR APPLICATION* 



Chariene Rivera 
Inter America 
Research Associates 
Rosslyn, Virginia 



Development of language proficiency assessment 
instruments has been greatly mfluenced by the various 
interpretations of linguistic theories and their applica- 
tion. Assessment measures developed in the 1950's 
and 1 960's were greatly influenced by structural linguis- 
tic views of language and psychometric methodology 
which prompted the testing of discrete aspects of lan- 
guage. These instruments were generally intended for 
adults learning a foreign language and, as such, as- 
sumed literacy m the native language. 

During the 1970's. '-the influence of psycholinguistics 
was reflected in the attempt to incorporate generative 
linguistic theory into language proficiency testing. The 
concept that "the reliable variance in a great variety of 
educational and psychological tests can be attributed 
to a single global factor of language proficiency" (Oiler. 
1979, p. 61) motivated the development of tests which 
"must invoke the expectancy system or grammar of the 
examinee* (p. 16). Oiler posits that tests which reflect 
the pragmatic perspective such as dictation, cloze and 
their variations "probably provide more accurate infor- 
mation concerning language proficiency. . .than the 
more familiar tests produced on the basis of discrete 
point theory ' (1 979, p. 9). 

The sociolinguistic/ethnographic perspective entailing 
"the notion that children's school language should be 
viewed within a broader framework of culturally ac- 
quired communicative competence" (Philips, 1980, pp. 
2-3) has recently begun to influence language proficien- 
cy assessment practices. Methodologicaliy this ap- 
proach implies the focused observation of students* lan- 
guage use m naturalistic settings In contrast to the 
traditional approaches to testing, it does not generally 
rely on the paper and pencil type of tests which can be 
statistically analyzed 

While traditional psychometric approaches to testing 
are generally used for purposes of identification and 
placement, they are recognized to be inadequate Thus, 
the controversy remains as to the nature of language 
and how to best measure it Issues which have not been 
adequately addressed by traditional testing procedures 
include such basic questions as: 
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• What does it mean to be proficient? 

Does it mean a person's receptive/expressive 
knowledge of discrete grammatical compo- 
nents of a language? 

Does it mean a person's knowledge of linguistic 
code(s) and its (their) appropriate use in dif- 
ferent social contexts? (Note 1) 

What are the variables which influence language 
use during communicative interactions? 

• How should these variables be incorporated into 
the development of language proficiency 
measures? 

Current research which has implications for language 
proficiency assessment practices includes research in 
adult language proficiency testing, cognitive studies 
which attempt to conceptualize the construct of lan- 
guage proficiency, development of theoretical models 
of communicative competence, studies investigating 
the validity of measures of communicative competence, 
and ethnographic/sociolinguistic studies of children's 
functional use of language. Representative research in 
each area will be reviewed. 

Research efforts in adult language proficiency testing 
are represented by the work of the Foreign Service Insti- 
tute (FSI) and the Educational Testing Service (ETS). 
These efforts are discussed because of their potential 
application to the assessment of language minority 
students. 

The Foreign Service Institute (FSI) has played a signifi- 
cant role in the development of an oral interview testing 
system to assess foreign language proficiency The 
system has evolved over the past thirty years. The sig- 
nificance of this effort lies in the standardization of oral 
testing procedures, the aspects of performance that are 
to be observed, and their rating. While the FSI Oral Inter- 
view Test represents a positive effort in the measure- 
ment of oral language proficiency of adult foreign lan- 
guage learners, the system; is limited in its ability to 
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measure the effectiveness of the communication pro- 
cess within cultural contexts (Rice. 1959). Currently ef- 
forts are being made to emphasize "more realistic uses 
of language" (Jones. Note 2). with a greater stress being 
placed on the comprehensive ability of individuals 
being tested. The goal of the FSI Oral Interview Test is 
to predict governme/t employees' successful use of the 
target language in,6verseas assignments. Although de- 
signed to assess adults' foreign language proficiency, it 
has the potential^o be adapted to assess the language 
proficiency of school age LEP students. ; 

Clark (1980)/ proposes the development of a "common 
measure" of speaking proficiency in second and foreign 
languages/for use with high-school and adult learners. 
He defines "common measure" as .a uniform testing 
procedure that can be used with diverse groups of 
examinees in a variety of language learning situations 
with testing results reported on a single uniform scale" 
(p 15) He argues that there is a need for development 
of a measure of speaking proficiency that can be validly 
used in a series of different situations for which develop- 
ment of specific procedures is not realistic. In an update 
on the development of such a measure. Clark (1981) 
emphasizes its benefits and potential use: 

• to further the development of a more sophisticiated 
measurement procedure; 

• to make available a cost and time effective instru- 
ment which evaluates communicative proficiency 

. in different situations and for different second lan- 
guage learners; 

• to provide a highly valid and reliable instrument for 
use m the validation of other tests of speaking 
proficiency 

Cummms suggests that the present state of the art in 
languaj^e proficiency assessment is confused because 
of the "tailure to develop an adequate theoretical frame- 
work for relating language proficiency to academic 
achievement" (1981). He contends that:' 

there has been a failure to adequately conceptualize 
the construct of language proficiency and its cross- 
Imgual dimensions. In other words, there has been 
relatively little inquiry Into what forms of language 
presidency are related to the development of litera- 
cy skills in school contexts, and how the develop- 
ment of literate proficiency in relates to the devel- 
opment of literate proficiency in L ^. 

He posits that there are two dimensions to language, 
proficiency cognitive-academic language proficiency 
(CALP) and basic interpersonal communication skill 
(BICS) CALP refers 'to the dimension of language profi- 
ciency that IS strongly related to literacy skills. BICS 
refers to cogmtively undemanding manifestations of 
language in interpresonal situations" (Cummms. 1980. 
p 28) He hypothesizes that these two dimensions of 
language can be empincally distinguished and that the 
native (L ) and second language (I) CALP-like skills are 
manifestations of the same underlying dimension 
Based on these hypotheses, which are currently bewip 
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investigated (Cummins, et al. Note 2). Cummins sug- 
gests that "placement of bilingual children in different 
types of instructional programs should not be based 
only on 'natural communication' (BICS) tasks (but that) 
developmental levels of L and L CALP should also be 
taken into account" (198b. p. 54). Thus, he strongly 
recommends that Students' literacy skills be tested in 
both L and L before placement/exit decisions are 
made. 

Sociolinguistic theory and research in teachmg second 
language learners has brought the concept of commu- 
nicative competence (Hymes. 1974) into prominence. 
While many interpretations have been given to this con- 
cept, in general, it is used to refer to mastery of commu- 
nicative skills acquired by second language learners 
and the appropriate use of these skills during social in- 
teractions. Two models based on this concept are pre- 
sented here because they provide a frame of reference 
for ongoing developments in language proficiency 
assessment. 

Canale and Swain. (1979) suggest a model which is 
based on the identification of features considered im- 
portant for communicating. These communication fea- 
tures are characterized as being interaction-based, un- 
predictable, creative and purposive. Communication is 
authentic, rather than contrived, and takes place within 
sociolinguistic and discourse contexts. Successful com- 
munication is judged on the basis of actual outcome. 
The three components of the communicative compe- 
tence model are: linguistic, sociolinguistic. and strategic 
competencies. Linguistic competence deals with mas- 
tery of the grammar of a language. Sociolinguistic 
competence involves mastery of appropriate language 
use with an emphasis on meaning and the appropriate- 
ness of the linguistic forms used to convey meaning. 
Strategic competence refers to second language lear- 
ners' ability to compensate, repair and use other strate- 
gies in their attempts to communicate with other 
participants. 

Research by Slaughter and Bennett (1981) into the 
nature of discourse of bilingual children expands on 
Swain and Canale's discourse component. Bennett 
(1981) describes discourse as being temporal, reflexive, 
multivocal and multimodal. It is temporal in that it 
evolves through time. It is reflexive in that language cre- 
ates the context for it to be understood. It is multivocal 
m that different meanings and interpretations are 
always available to participants, and it is multimodal in 
that it involves the selection of varied modalities such 
as choices of grammar, lexicon and nonverbal behaviors. 

Bnere (1979) developed a model which recognizes soci- 
olinguistic and linguistic competence as components of 
communicative proficiency These competencies have 
two dimensions: one at the abstract level, or 
linguistic/soclollnguistic competence, and the other at 
the performance level, or hnguistlc/sociolinguistic 
performance At the performance level, communicative 
proficiency is associated with the speaker s use of the 
grammar of the language in appropnate social, 
interactions 

i 
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The relevance of both Canale and Swain's and Briere's 
attempts to develop models of communicative profi- 
etency \s that they provide researchers and practitioners 
witfi a comprehensive framework The models suggest 
that language proficiency tests should consider knowl- 
edge of the linguistic code concurrently with appropri- 
ateness of its use during social interactions. 

Bachman and Palmer (1979.1981) investigated the con- 
struct validation of language tests m a two-phase study. 
They explored the validity of a simple model of language 
which consisted of two traits — speaking and reading. 
They found that the model statistically explained test re- 
sults better than a single factor model (Oiler, 1979). En- 
couraged by the findings, they expanded their research 
to investigate the construct validity of several other 
models of language. Using confirmatory factor analysis, 
they found that the model which showed the highest 
degree of statistical significance was a model that posit- 
ed one single factor and three specific trait factors: 
linguistic, sociolinguistic and pragmatic. The model 
was developed based on Canale and Swain's (1979) 
model of communicative competence. 

Ethnographic and sociolinguistic studies of children in a 
New York Puerto Rican community (Poplack. 1981) add 
to our understanding of the evolution and maintenance 
of language in stable bilingual speech communities. 
The role of code-switching of bilingual adults and child- 
ren investigated from an intergenerational perspective 
gives insights into the acquisition and use of code- 
switching and Its relationship to language proficiency. 

Rodriguez-Brown and Elias-Olivares (1981), in inves- 
tigating the communicative competence of bilingual 
children, concentrated on the expressive strategies 
used to make inquiries. Their findings indicate the im- 
portance of focusing on bilingual children's language 
use inexpenmental test situations. 

Simich and Rivera's work in cooperation with teachers 
and specialists from Tucson Unified School District. 
Tucson. Arizona, represents a preliminary effort to 
identify functional uses of language in elementary 
bilingual and monolingual classrooms. This work at- 
tempts to clanfy the reasons why children and other 
participants communicate and how children's knowl- 
edge of acceptable functional uses of language relates 
to language proficiency 

Sociolinguistic research highlights the difficulty of 
measuring language via discrete quantifiable means 
Shuy points out that the most critical aspects of lan- 
guage "are the ones least susceptible to quantification" 
(1977. p 77-78) In general, instruments of language 
proficiency measure easily quantifiable components of 
language such as pronunciation, vocabulary, and gram- 
mar, rather than those less visibl^ or quantifiable, such 
as meaning or the functional intent of utterances. Use of 
discrete point language assessment instruments does 
not provide val^id and accurate information about how 
effectively students participate in instructional settings 
Thus, the soQf)Unguistic approach promotes eval 3tion 
^ of the apprypV/iite use of language in different commu- 
© .iicative .s/Ujations Its contribution to the development 



of language proficiency assessment instruments is 
based on a broader interpretation of language which in- 
cludes the use of linguistic code(s) by participants in 
ways which are acceptable lo other members of a 
speech community. 

Studies in language variation have expanded the idea of 
the linguistic repertoire of speech communities (Gum- 
perz. 1972). Applied to language proficiency assess- 
ment, this concept supports the view that minority stu- 
dents' language proficiency should not be measured 
against the "standard" dialect of a language. Rather, a 
student's way of speaking should be considered ade- 
quate and appropriate in terms of the purposes it serves 
during communication. For example, in the southwest, 
where large numbers of Hispanic students live, code- 
switching incidences are common. This manner of com- 
munication should be considered appropriate given its 
functional use within the community. 

If we take the position that students' language should 
not be measured against a standard dialect, then we 
need to ask: why measure language proficiency? 

The measurement of language proficiency is necessary 
to provide teachers with an understanding of the lan- 
guage skills students have already acquired in the home 
environment. A comparison between home language 
skills and functional language demands in the 
classroom setting will provide necessary information 
upon which to place students in appropriate educational 
programs. 

More recent sociolinguistic research relates functional 
language use to language proficiency. Shuy defines 
functional language use as "the underlying knowledge 
that allows people to make utterances in order to ac- 
complish goals and to understand the utterances of 
others in terms of their goals. It includes a knowledge of 
what kinds of goals language can accomplish (the func- 
tions of language) and of what are permissible utter- 
ances to accomplish each function (language strate- 
gies)" (1977. p. 79). Several researchers in the last 
decade have investigated functional language use in 
school settings. The focus of their research has been on 
specific aspects of why and how children use language 
in different social contexts (e.g.. Cahir. 1978: Cazden. 
1979: Simich. 1980: Jacob. 1981). One outcome of the 
research on functional language use in the classroom 
has been the development of several sociolinguistically 
based language assessment instruments. 

For example. Shuy. McCreedy and Adger (1979) devel- 
oped an oral language assessnr^ent instrument for use 
with elementary schOQj childreh who are speakers of 
vernacular Black English (VBE). ^he instrument consists 
of three components. The first provides for approximate 
measurement of phonological and morphosyntactic fea- 
tures. The second component evaluates communicative 
competence according to relative appropriateness and 
strategies children use for /conversational functions 
such as explaining, describing, etc The third component 
evaluates discourse abilities, feuch as appropriateness of 
interrupting, use of transitional markers, referencing 
and style shifting 



Another effort toward the development of sociolinguisti- 
c/ethnographic measures is .represented in the work of 
Simichand Rivera (1981) in cooperation with Sunnyside 
and Tucson Unified School Districts (TUSD). in Arizona. 
An instrument, entitled the Teacher Observation 
System (TOSl was developed during a comprehensive 
two-year teacher training program. The goal of the pro- 
gram was to provide bilingual educators with back- 
ground in linguistics, sociolinguistics. ethnographic 
methodologies, measurement, and research methodolo- 
gies In order to enable them to develop accurate and ef- 



fective language proficiency assessment strategies. The 
TOS was developed based on the understanding that 
the range of students' language repertoire can only be 
determined through systematic and focused observa- 
tions of interactions in a variety of school settings. A 
framework to icj^ntify the variables that influence stu- 
dents' communicative interactions was developed, 
based on certain components of speech events 
(Gumperz & Hymes. 1972). This framework is described 
in Table I. 



TABLE I 

Components pf Instructional Events to Be Considered 
in the Development of the Teacher Observation System 



Setting 



Particpants 



Channels 
of Communication 



Language 
Used 



Discourse 
Characteristics 



Instructional 
(formal) 
vs. 

Non-instructional 
(informal) events 



Teacher/Student(s) 
Student/Student(s) 



speaking 
listening 
reading 
writing 



English 
Spanish 



coherence 
complexity 

adequacy of vocabulary 
code-switching 



The following questions helped participants focus on an 
ethnogr-'phic/sociolinguistic perspective during the de- 
velopment of the TOS. 

• What kind of communicative skills do bilingual 
students need to master in order to participate 
appropriately as members of the sociocultural 
school environment? 

• In which sociocultural situation can these com- 
municative skills be observed? 

• What kinds of communic&tive skills do students 
bring to school? 

• In which language(s) and sociocultural situa- 
tions do students have the widest contextual 
range of communicative abilities? 

The questions prompted a discussion of factors that in- 
fluence students' communicative interactions. The 
components considered were: classroom organization 
(teacher-centered vs. student-centered), language of in- 
struction, directness dr indirectness of "teacher-talk" 
and students' language use and their sociocultural back- 
ground; parents' socioeconomic and educational back- 
ground, number of siblings, and language use at home, 
school and community. 

Through the process of developing the TOS, teachers 
became aware of the sociocultural aspects of language 
as they acquired more sensitivity and understanding of 
language use m multicultural/multilingual school set- 
tings. This knowledge, they confirmed, assisted them in 
the classroom situation to make better judgments about 
their students' communicative proficiency. 



Slaughter and Bennett (1981) are attempting to develop 
a unified framework for the analysis of discourse sam- 
ples elicited from Spanish/English bilingual students. 
The framework will be used in the validation of the Lan- 
guage Proficiency Measure (LPM). The instrument de- 
veloped by the TUSD purports to measure the language 
proficiency of K-12 English/Spanish bilingual students 
based on samples of discourse elicited in an experimen- 
tal setting. 

In summary, theoretical and applied research from the 
psycholinguistic and sociolinguistic perspective is 
providing basic information that is proving useful in the 
further exploration of language proficiency assessment 
practices. The research cited is intended to be repre- 
sentative of both traditional and nontraditional ap- 
proaches to interpretations of the nature of language, 
language proficiency and language proficiency 
assessment. 



Reference Notes 

Social context refers to cultural/physical contexts 
in which communicative events take place. The 
social context is an important variable which in- 
fluences communicative outcomes. In school set- 
tings, both academic and nonacademic events are 
common. Examples of social contexts range from 
math cj^ss to recess. 

Jones. D. Personal Communication Foreign Service 
Institute. Rosslyn. VA. June 17. 1981 
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During the past few years we have been giving some 
thought to the feasibiity of developing assessment 
instruments which ar^consistent with a communicative 
approach to second tanguage instruction. In doing this. 
It has been essential toNjome to grips with what is 
meant by a comnpiifnicative approach. In general, a com- 
municative approach tmplies a fundamental shift of 
emphasi^ dWay from language teaching through form 
to language teaching through meaning. The danger in 
this shift is that little place may be left for the teaching 
of form, that is. the grammatical aspects of the lan- 
guage. Morrow (1977) has commented in relation to 
communicative approaches to second language teach- 
ing and testing that "the polarization >A/hich can be 
detected in some quarters between a concentration on 
form and a concentration on meaning must be resisted" 
Ip. 1 1 ). It is certainly the case that an examination of cur- 
rent literature on communicative competence and per- 
formance, of pedagogical materials and teaching tech- 
niques which claim to be directed towards the develop- 
ment of communicative skills, and of testing instru- 
ments which claim to be measuring the knowledge and 
use of those skills suggests that there is less than termi- 
nological consensus about what is meant by a commu- 
nicative approach. The importance, of reaching a con- 
sensus must not be underrated, both for purposes of 
program design and program evaluation. Without it. for 
example, competing claims for the effectiveness of dif- 
ferent instructional programs in developing communica- 
tive abilities cannot be assessed (Canale and Swain. 
1979). 

Our own consideration of this matter has been signifi- 
cantly influenced by the target population with whom 
the assessment instruments will be used. This popula- 
tion consists of English-speaking students in Ontario at 
the grades six and ten levels who will have had a maxi- 
mum number of accumulated hours of French as a 
second language (FSL) instruction of 540 and 720^ at 
the end of their respective school years. Thus our target 
population consists of school-aged children who are at 
a relatively early stage of second language learning. It is 
our opinion that a de-emphasis on the importance of 
grammar in some of the current discussions about com- 
municative approaches to second language instruction 
derives from the fact that the target populations under 
consideration differ from the ones we are concerned 
with m that they have already achieved advanced levels 
of grammatical competence in the second language. 



InJHjr attempts to outline the essential elements of a 
communicative approach for general programs of 
second language instruction, we initially focussed on 
the literature related to communicative competence.^ 
The purpose of this paper is to discuss several different 
notions of what is meant by communicative compe- 
tence, arguing ultimately for a framework which in- 
cludes three components: grammatical competence, 
sociolinguistic competence, and strategic competence.^ 
Several issues raised by this proposed framework for 
second language teaching and testing will then be dis* 
cussed. For a full discussion of these and related issues, 
see Canaleand Swain (1980). ^ 

It is important at the outset to note that by being con- 
cerned with communicative competence, we are not 
assuming that it is the highest or broadest level of lan- 
guage competence that can be distinguished or that is 
relevant for second language teaching purposes. The as- 
sumption that communication is the essential purpose 
of language is widespread (see. for example. Campbell 
and Wales. 1970; Groot. 1975: Munby 1978) and 
would seem to imply that communicative competence 
is the most inclusive language competence. However, 
as has been pointed out (see. for example. Chomsky, 
1 975; Fraser, 1 974; Halliday 1 978). there is little reason 
to view exrema//y oriented communication as more es- 
sential than other purposes of language such as self- 
expression, problem solving and creative writing. Non- 
etheless, the communicative purpose would seem to be 
the most practical concern for a general second lan- 
guage program. For a bilingual education program, how- 
ever, focussing only on the communicative function of 
language would appear to be too limited. 

Another general point needs to be made at the outset 
concerning the distinction between competence as 
knowledge of a system of rules and performance as the 
actual use of the rules in communicative acts. We have 
maintained this distinction despite the obvious state- 
ment that all observed behaviour is performance. The 
distinction is significant not only with respect to 
classroom activities, but to testing as well. For example, 
the distinction suggests that communicative testing 
must be devoted not only to what the learners know 
about the second language and about how to use it. but 
also to what extent the learner is able to actually 
demonstrate this knowledge in meaningful communica- 
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tive situations. The former suggests the. use of 
competence-oriented tasks with the possibility of 
discrete-point items for diagnostic purposes. The latter 
suggests the use of performance-oriented tasks which 
conform more directly to normal language use in which 
an integration of these skills is required with little time 
to reflect on and monitor language input and output (i.e. 
Clark's 'direct* tests of language proficiency). One would 
not Wdni to ignore performance tasks completely in a 
communicative testing program even if more 
competence-oriented tests that correlated highly with 
actual performance were developed. As John Clark 
11972) states: 

Indirect tests of proficiency do not provide an op- 
portunity for the student to try out his language 
competence in realistic communicative situations. 
Although they may correspond in a statistical sense 
to direct tests of proficiency, paper-and-pencil 
tests, tape-recorded listening and speaking tests, 
and similar measures cannot have the same psy- 
chological value for the student orthe same instruc- 
tional impact. For this reason alone, administration 
of a direct test of communicative proficiency at one 
or more points in the student's language-learning 
career would be a very worthwhile undertaking (p. 
132). 

This also corresponds to one of the essential elements 
of a communicative approach: "that the second lan- 
guage learner must have the opportunity to take part in 
meaningful communicative interaction with highly 
competent speakers of the language, i.e. to respond to 
genuine communicative needs in realistic second lan- 
guage situations'* (Canale and Swain, 1980). If the in- 
structional program includes this, then sc should the 
testing program. 

Let us turn now to a consideration of the literature on 
communicative competence. It is common to find the 
term communicative competence used to refer exclu- 
sively \o knowledge of the rules of language use and the 
term grammatical (linguistic) competence used to refer 
to knowledge of the rules of sentence grammar The 
terms are used in this manner by. for example, Allen 
(1978), Jakobovits (1970), Palmer (1978). Paulston 
(1974) and Widdowson (1971). It is equally common to 
find the term communicative competence used, as pro- 
posed by Campbell and Wales (1970) and Hymes 
(1972). to include not only grammatical competence, 
but also contextual or sociolinguistic competence, that 
is. knowledge of the rules of language use. Cooper 
(1968). Morrow (1977), Munby (1978) and Savignon 
(1972). among others, use the term communicative 
competence in this way. 

Munby argues that the view that communicative 
Competence includes grammatical competence is to be 
preferred to the view that it does not because it avoids 
the misleading conclusion that grammatical compe- 
tence IS not ess,ential to communicative competence. 
Hymes (1972) has pointed out that there are rules of 
grammar that would be useless without rules of lan- 
guage use. Similarly, we have claimed (Canale and 
Swdin. 1980) that there are rules of language use that 
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would be useless without rules of. grammar. For exam- 
ple, one may have an adequate level of sociolinguistic 
competence in Canadian French just from having devel- 
oped such a competence in Canadian English, but with- 
out some minimal level of grammatical competence in 
French, it is unlikely that one could communicate effec- 
tively with a monolingual speaker of Canadian French 
(cf. also Morrow, 1977 on this point). Thus it seems es- 
sential to incorporate into any notion of communicative 
competence both grammatical and sociolinguistic 
knowledge. 

This position does not, however, include another com- 
monly held view about what constitutes communicative 
competence — the ability to get one's meaning across 
regardless of the appropriateness or grammaticalness 
of the utterance.^ There would seem to be an underlying 
assumption in some communicative approaches to 
second language instruction that more effective sacond 
language learning takes place if emphasis is put from 
the beginning on getting one's meaning across, rather 
than on the grammaticalness or social appropriateness 
of one's utterance. 

The view that meaning sh jld be emphasized over 
grammaticalness has three bases, each of which 
deserves to be examined when the instructional pro- 
gram is intended for adolescent learners. One basis is 
the observation that children in acquiring a first lan- 
guage seem to focus more on being understood than on 
speaking grammatically, and. therefore, second lan- 
guage acquisition might be permitted to proceed in this 
way as well. Secondly, there Is the assumption that des- 
pite initial ungrammaticalness on the part of the child 
learner with respect to the first language, the first lan- 
guage is always ever jally acquired. In other words, it 
is assumed that an mitial emphasis on getting one's 
meaning across does not preclude the attainment of full 
grammatical competence at a later stage. The third 
basis for suggesting that second language acquisition 
might be allowed to proceed like first language acquisi- 
tion is the belief that language learning is more effective 
when the learner is involved in real communicative 
acts, rather than rehearsing grammatical forms. 

Let us consider each of these points in turn. Th^ sugges- 
tion that second language acquisition might be allowed 
to proceed like first language acquisition derives in part 
from evidence which suggests that the processes of 
first language acquisition are similar to those of second 
language acquisition. The evidence comes mainly from 
studies which have demonstrated that many errors 
made by first and second language learners are similar 
(e.g. Dulay and Burt, 1974; Richards, 1972). However, 
considerable evidence has also been accumulated to 
suggest that many errors are different due at least in 
part to language transfer (e.g. Canale. Mougeon and 
Beniak. 1978; Harley and Swain. 1978; Schachter and 
Rutherford, 1979). These .errors may lead to grammatical 
forms that are difficult to understand or that may be 
misinterpreted by native unilingual speakers. Addition- 
ally, neurological (Lenneberg. 1967; Scovel, 1978). 
cognitive (Rosansky, 1975). affective (Schumann, 1975). 
and contextual (Stern, 1976) factors have been cited as 



grounds for differences between children, learning a 
first or second language, and adolescent and adult 
second language learners. For example, it has been 
argued that the ons^t of formal operations changes the 
way in which language data are processed and stored 
(Rosanky. 1 975) and that the presentation of material to 
adolescent or adult learne»'S should therefore be modi- 
fied to take account of these learners' dominant pro- 
cessing strategies. 

The second assumption — that despite initial ungram- 
maticalness on the part of the child first language lear- 
ner the target language is always eventually acquired 
— is clearly untenable where adolescent or adult 
second language learners are concerned. There are 
many studies which report on the interlanguage of 
aduit second language learners, and the fossilization of 
errors (see. for example. Canale. Mougeon. Belanger and 
Ituen. 1977: Richards. 1975:^ Selinker. 1969). It is also 
the case that fossilization may occur with child second 
language learners as well. 

Consider the case of children learning a second lan- 
guage in which initial emphasis has been placed on get- 
ting across one's meaning. The example to be discussed 
is that of young children in French immersion programs 
in Canada. In these programs. English-speaking children 
who are initially uniiingual receive instruction for the 
first several years of schooling entirely in French. They 
are young, and it has been thought inappropriate to 
teach grammar in any formal way. The emphasis is on 
having the children understand what the teacher has to 
say through instructional techniques which make con- 
siderable use of concrete situations, visual aides, and 
non-verbal cues. The children initially speak in English 
to their teacher, who is bilingual and can therefore un- 
derstand the children and respond appropriately in 
French. The children are encouraged to use what 
French they have acquired through their exposure to 
French" as well as through the active instruction of 
vocabulary. In effect, the children are encouraged to use 
their French long before they have the grammatical 
knowledge to express their ideas. They resort to numer- 
ous communicative strategies (see Tarone. 1978). For 
example, -they use English words in otherwise French 
utterances, they use English structures, and they use ad- 
verbs of time to express tense. There is initially minimal 
explicit correction by the teacher partly in order not to 
discourage the children from using French by constantly 
interrupting and correcting them, and partly because 
the' curriculum emphasis is on getting one's ideas 
across. At later grade levels, there is more explicit cor- 
rection, both of oral and written expression. There is 
also some teaching of grammar, although we do not 
know the extent to which this occurs, or the form it 
takes. It would appear to vary considerably from teacher 
to teacher, and from school board to school board.^ 

What are the results related to these students' second 
linguage learning? The details of the answer depend 
partly'on what grade level the children were in when 
the data were collected, how the data were collected. 
- and which of the skills — speaking, understanding, read- 
ing or writing — were being examined. We do not 
intend to report here oh all the details They are reported 
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elsewhere (see. for example. Andrew. Lapkin and 
Swain. 1979: Barik and Swain. 1975: 1976: Harley and 
Swain. 1^77: 1978: Lambert and Tucker. 1972: Lapkin 
ad Swain, 1977: Selinker. Swain and Dumas. 1975; 
Spilka. 1976: Swain. 1975: 1978: 1979: S^arrosi. 
Swain and Lapkin. in press). Furthermore the research 
associated with getting one's meaning across has 
focussed mainly on oral skills, rather than on compre- 
hension, and it is on this former aspect that we want to 
focus now. It is important to note in the following dis- 
cussion, however, that the immersion children showed 
no sign of not understanding what was said to them. 
Their ability to understand was native-like. 

The results indicate that although there is gradual "im- 
provement over time, the children's speech contains 
numerous grammatical errors. For example, in a descrip- 
tive study of the verb system of fifth-grade immersion 
students (Harley and Swain. 1978). it was noted that the 
French verb system of these children is a simplified one: 
it does not cover the complexities of. for example, the 
conditional tense, and morphologically irregular verbs 
are made regular, most verbs are being treated as 
though they belonged to the simplest, most regular "er" 
conjugation (i.e. verbs such as parler "to speak"). A com- 
parison group of francophone children was found to 
have made very few errors of this kind. The study did ^ 
find, however, that the range of meanings that the im- 
mersion students were able to express in the interviews 
was comparable to that of the francophone students. 

In another study with grade two immersion atodents. 
we arranged for several immersion students each to 
spend a couple of hours each week .playing together 
with a francophone child over a period of two months. 
Their spontaneous interactions during these play ses- 
sions were recorded. The tape recordings tell us two 
things. First, the immersion children made numerous 

I grammatical errors. Secondly, in spite of this, they clear- 
ly had "co^^municative confidence" — they did not hesi- 

< tate to engage in communicative activities in French. 
The childre/^ argued, persuaded, agreed, contradicted. 

I played with\ words, interrupted, changed topics, ex- 

1 changed information, made jokes, etc. (for an example 
see Szamosi. Swain and Lapkin. in press). 

,The conclusion to be drawn from these studies is that 
;early emphasis on getting one's meaning across results 

iin the ability to do so. but it does not necessarily imply 

that grammatical accuracy is the result. 

To be sure, if the immersion children were placed in an 
Environment where only uniiingual francophdnes were 
present, they might develop this grammatical compe- 
tence. That is to say. their current communicative needs 
in interacting with individuals whose own competence 
includes knowledge of English grammar are satisfied, 
arid further development is not necessary in their partic- 
ulpr context. Perhaps the expression of more sophisti- 
calted ideas, out of the here-and-now context, with uni- 
lirlgual speakers would lead to the learning of the gram- 
matical means essential to express them. 8ut perhaps 
not. Perhaps they would continue to rely on commu- 
nicative strategies that result in getting across their in- 
• i 



tended meaning, but in a non-native-like way.® Knowl- 
edge abput. and use of, communicative strategies (i.e. 
strategic competence) is. in our opinion, an important 
component of communicative competence, but not the 
sole component. 

The third basis for the suggestion that second language 
acquisition should be allowed to proceed like first lan- 
guage acquisition is" the belief that language learning is 
more effective when the learner is involved in real com- 
municative acts, rather than rehearsing grannmatical 
fQrms. The previous example concerning immersion 
children speaks partly to this issue in that they were not 
involved in rehearsing grammatical forms. If "effectivirV 
language learning is to include the concept of gram- 
matical accuracy, then it is necessary to conclude that 
the program has not been fully effective. 

There is no similar study, that we are aware of, related 
to adolescent or adult learners. However, there are 
several studies which do address the question of the 
effect on communicative abilities of second language 
programs which focus initially on teaching grammar 
alone or alternatively on teaching grammar and basic 
communication skills. Let us turn now to these studies 
to see what insights they can provide concerning the 
role of teaching grammar in developing communicative 
abilities. 

It is a commonly held view that knowledge of grammar 
alone does not necessarily lead to effective communica- 
tion. Tucker (1974). for example, found that students 
who scored high (95th percentile) in English language 
proficiency as demonstrated by the Michigan Test of En- 
glish Language Performance and the Test of English as 
a Foreign Language, and those who scored low ibelow 
the 60th percentile), scored equivalently on three of the 
four communicative tasks. The communicative tasks in- 
volved having the testee describe an object or picture to 
a listener on the other side of an opaque screen such 
that the listener could identify the object or picture from 
among an array of such items before him. It seems that 
an appropriate conclusion from this and other similar 
studies (for example. Upshur and Palmer. 1974) is that 
focus on grammatical competence in the classroom is 
not a sufficient condition for the development of com- 
municative competence. It would be inappropriate, 
however, to conclude from these studies that the devel- 
opment of grammatical competence is irrelevant to. or 
unnecessary for. the development of communicative 

\ . competence, given that all subjects in the studies did 

\^ have grammatical training. 

\ Savignon (1972) undertook a study to determine the 
\ effect of teaching both grammar and communicative 
\ skills. In her study, three groups of students were en- 
rolled in a basic introductory audio-lingual course of 
Wench at the university level. All three groups received 
tl^ie same number of hours of instruction in the standard 
(fcjrmal and grammatical) program, but one group had 
an\addttional hour per week devoted to communicative 
tasks in which the emphasis was mainly on getting 
one^s meaning across and on being able to cope in com- 
municative situations. The second group spent an addi- 
Q ♦lonal hour per week in a "culture lab" program, and the 
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third spent an additional hour per week in a language 
laboratory program. 

Savignon found that although there were no significant 
differences among groups on tests of grammatical 
competence, the communicatively trained group scored 
significantly higher than the other two groups oh four 
communicative tasks she developed. The first task was 
a discussion in French between a student and a native 
speaker of French on one of three topics: the second, an 
information-getting interview in which the student had 
to find out about the native speaker by asking him 
questions: the third, a reporting task in which the stu- 
dent had to discuss a given topic first in English, then in 
French; and the fourth, a descriptive task in which the 
student had to describe an ongoing activity. The criteria 
of evaluation for these tests included effort to 
communicate, amount of communication, semantic 
accuracy of information, comprehensibility and 
suitability, and naturalness and poise in keeping a 
verbal interaction in hand. Savignon claims that "the 
most significant findings of this study point to the value 
of training in communicative skills from the very begin- 
ning of the program" (1972. p. 9). 

Oiler and Obrecht (1968) report a similar conclusion. In 
this study, they found that the effectiveness of pattern 
drills is significantly increased when the language in the 
drill is related to communication. Their conclusion is 
that from the beginning of a second language program, 
aspects of grammatical competence should be taught 
in the context of meaningful communication. Thus, in 
sum. these studies suggest that some combination of 
emphasis on grammatical accuracy and emphasis on 
meaningful communication from the start of second lan- 
guage study is effective in developing both grammatical 
skills and communicative skills. 

With these considerations in mind, one of five principles 
which underlie what we consider to be essential to a 
communicative approach to second language teaching 
and testing was formulated. It states that: 

Communicative competence is composed minimal- 
ly of grammatical competence, sociolinguistic 
competence, and. . .strategic competence. There is 
no strong theoretical or empirical motivation for the 
view that grammatical competence is any more or 
less crucial to successful communication than is 
sociolinguistic competence or strategic compe- 
tence. The primary goal of a communicative ap- 
proach must be to facilitate the integration of these 
types of knowl$d§e-for theJearfier. an outcome that 
is not likely ta result from overemphasis on one 
form of competence over the others throughout a 
r>econd language program (Canale and Swain. 
1980). 

A brief outline of the boundaries and contents of the 
three components of communicative competence is 
therefore in order. A more complete description is found 
in Ganale and Swain (1980). Grammatical competence 
is understood to include knowledge of lexical items and 
o of rules of morphology, syntax, sentence-grammar 
semantics and phonology. Grammatical competence is 
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an important concern for any communicative approach 
whose goals include providing learners with the knowl- 
edge of how to determine and express accurately the lit- 
eral meaning of utterances. 

Sociolinguistic competence is understood to include 
knowledge of socfocu/tural rules of use and rules of dis- 
course. Knowledge of these rules will be crucial in inter- 
preting utterances for social meaning, particularly when 
there is a low level of transparency between the literal 
meaning of an utterance and the speaker's intention. 
The primary focus of soc/ocu/tura/ rules of use is on the 
extent to which certain propositions and communica- 
tive functions are appropriate within a given sociocuh 
tural context depending on contextual factors such as 
topic, role of participants, setting, and norms of interac- 
tion. The primary focus of discourse rules is on the cohe- 
s/venessof groups of utterances, that is. the grammati- 
cal links, and on the coherence of groups of utterances, 
that is. the appropriate combination of ideas and com- 
municative functions (see Halliday and Hasan, 1976, 
and Widdowson. 1 978, for discussion). - 

Strategic competence Is understood to include verbal 
and nonverbal communication strategies that may be 
called into action at times when the flow of speech 
might otherwise be impeded due to performance varia- 
bles or to insufficient competence (see also Tarone, 
1 978.) Such strategies are of two main types: those that 
relate primarily to grammatical competence, for exam- 
pie, how to paraphrase grammatical forms that one has 
.not mastered or cannot recall momentarily; and those 
that relate more to sociolinguistic competence, such as 
how to hold one's place in a conversation and how to 
address strangers when unsure of their social status. 

Adoption of this framework of communicative compe- 
tence raises a number of issues for second language 
pedagogy and testing. The discussion here is limited to 
those which are directly relevant to the argument that 
grammatical competence is to be considered an essen- 
tial component of communicative competence. 

It should be noted that it is possible to have a commu- 
nicative approach to second language instruction in 
which principles of organization are either grammatical- 
ly based or functionally based. In our opinion, even 
though grammatical competence is considered to be an 
essential component of communicative competence, a 
program organized on the basis of communicative func- 
tions is to be preferred to a program organized on a 
grammatical basis. Morrow (1977) and Johnson (1977, 
1 978) have expressed a concern that a second language 
syllabus organized on the basis of communicative func- 
tions may be disorganized with respect to grammar 
Thus Johnson (1977) writes: 

It seems reasonable to expect sentence$ which 
form a homogeneous functional grouping to be 
grammatically unlike. The choice of a functional or- 
ganization therefore seems to imply a degree of 
structural 'disorganization/ to the extent that many 
structurally dissimilar sentences may be presented 
in the same unit, while what may be taken to be key 
examples of particular grammatical structures will 
Q be scattered throughout the course (p 669), 
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Furthermore, it is possible that this disorganization may 
impede second language learning more at the initial 
stages of study than at later stages (Canale and Swain. 
1980). However, this is an open question; there are no 
empirical data on the relative effectiveness of grammati- 
cally organized syllabuses versus functionally organized 
syllabuses at any stage of learning, nor will there be any 
satisfactory answers until tests which incorporate the 
essential components of communicative competence 
are developed. Obviously, testing mstruments which 
only measure one component of communicative compe- 
tence will give a distorted picture of what the second 
language program is able to pchieve. Furthermore, if dif- 
ferent evaluations assess programs by focusing on dif- 
ferent components, no comparisons across programs 
can validly be made. 

In spite of Morrow and Johnson's concern, it is by no 
means an established fact that a functionally organized 
approach cannot achieve a level of grammatical organi- 
zation that is adequate for second language teaching 
and learning. As Canale and Swain (1980) suggest, 
there may be mearis of introducing an adequate level of 
grammatical sequencing into a functionally organized 
approach by: \ 

1. making use of Igrammatical sequencing criteria — 
e.g. degree of complexity, generalizability and trans- 
parency with respect to functions, and acceptability 
in terms of perceptual strategies — in selecting the 
grammatical forms to be introduced in covering a 
given function; 

2. treating such grammatical sequencing criteria as 
an essential subset of the set of criteria used to 
determine functional sequencing; 

3. making use of repetitions of grammatical forms in 
different functions throughout the syllabus, assum- 
ing that such forms are partially specified on the 
basis of their generalizability; and 

4. devoting a certain proportion of classroom time and 
textbook coverage to discussion of and/or practice 
on new^or especially difficult grammatical points 
and on interrelationships among various points as 
implemented, for example, in the course materials 
prepared by Johnson and Morrow (1978). Other 
possibilities will no doubt suggest themselves in 
the course of research on achieving the optimal bal- 
ance between functional and grammatical organiza- 
tion at a given stage of study. 

Another reason for proposing a communicative ap- 
proach organized on a functional basis has to do with 
its face validity. It is our view that a functionally based 
communicative approach — in particular, one in which 
units are organized and labelled with reference to com- 
municative functions — is more likely to have positive 
consequences for learner motivation than is a grammati- 
cally based communicative approach — in particular, 
one in which units are organized and labelled with refer- 
ence to grammatical forms. This view needs, however, 
to be empirically tested. 




Concerning the development oi assessment instru- 
ments, it has already been pointed out that the distinc- 
tion between competence and performance suggests 
the possibility of discrete-pomt items to measure the 
components of communiccitive competence. It must be 
noted, however, that in order to be consistent with a 
communicative approach, the point being tested must 
be presented in a meaningful context that is relevant to 
the learner's needs. Morrow (1977. p. 28) has suggested 
that discrete-point tests may be expected to address 
the learner's competence (sociolinguistic competence 
in our framework) in assessing a communicative interac- 
tion in the following terms: 

1 . the settings to which it might be appropriate. 

2. the topic which is being presented. 

3. the function of the utterance. 

4. the modality or attitude adopted by the 
speaker/writer. 

5. the pre-suppositions behind the utterance. 

6. the role the speaker/writer is adopting. 

7. the status implicit in the utterance. 

8. the level of formality on which the speaker/writ- 
er is conducting the interaction. 

9 the nnood of the speaker/writer 

A fleshing out of the components within our framework 
should certainly lead to additions to this list. An obvious 
gap in the above list is arpy direct reference to grammati- 
cal competence or strategic competence. 

Based on both a theoretical argument as well as on 
limited empirical evidence, this paper has suggested 
that communicative competence should be considered 
to consist minimal'/ of three components — grammati- 
cal competence, sociolinguistic competence, and strate- 
gic competence. The empirical evidence suggests that 
grammatical competence is a necessary but not suffi- 
cient condition for e^/ecf/ve communication; that initial 
emphasis on strategic competence does not necessarily 
lead to full grammatical competence; and that initial 
emphasis on grammatical competence and strategic 
competence does not impede the acquisition of gram- 
matical competence, while it aids the acquisition of 
strategic competence. Although not discussed in the 
paper, it would also appear, if the immersion children 
can be taken as an example, that sociolinguistic compe- 
tence 'transfers quite naturally in many instances from 
first language competence. 

In addition, the paper has suggested that although 
grammatical competence, plays a significant role in 
communicative competence, a communicative ap- 
proach to second language teaching which is organized 
on a functional basis, rather than on a grammatical one. 
IS to be preferred. However, considerable research evi- 
dence IS needed to support this view. 

In fact, this paper points to the general paucity of re- 
search that has been undertaken with regard to the ef- 
fectiveness of vanous programs in tpaching and testing 
communicative competence and communicative perfor- 
mance It IS hoped that tho general framework provided 
here and expanded on in Canale and .^^'^'ain (1980) will 
*ead to a more comprehensive conception of the nature 



of communicative competence that will guide program 
designers and test developers, and against which 
claims for a communicative-based approach to second 
language teaching and testing can be evaluated. Clearly 
there remains an enormous amount of work to be done, 
from the fleshing out of the framework suggested here 
to the development and validation of test items, and to 
research, for example, on how various levels of commu- 
nicative competence by second language speakers of 
different ages are perceived by target language speak- 
ers, and thus what the relative importance is of the 
three components of cogimunicative competence at 
various stages of learning. Ultimately it may be research 
of this sort which will permit us to determine what the 
target levels of general second language programs 
should be, and therefore, what the optimal balance of in- 
structional emphasis on grammatical, sociolinguistic 
and strategic competence should be. 



Footnotes 

This work has been funded under contract by the Onta- 
rio Ministry of Education. We would like to thank the 
Ministry for its support. We would also like to thank 
Birgit Harley and Sharon Lapkin who provided insightful 
comments after reading an earlier draft of this paper, 
which was presented as a Forum Lecture at the TESOL 
Summer Institute. UCLA. July 1 979. 

1. A student who has had 720 accumulated hours of 
FSL instruction ^might have had. for example*. 40- 
minute classes over a period of 6 years. 

2. For an update of this theoretical framework and re- 
search bearing on it. see Canale (1981). 

3. Note that the emphasis here is only on oral produc- 
tive skills (speaking). 

4. Evidence which supports this suggestion comes 
from a Ph.D. thesis in progress in which it has been 
found that unilingual francophones judge the errors 
made by immersion students (age 1 1) to be less ac- 
ceptable and more irritating than do bilinguals 
(French-English) (Lepicq. forthcoming). 

5. This is a matter of considerable interest. It would be 
instructive in the debate of the role of teaching 
grammar per se to compare the spontaneous 
speech (and writing) of children who have had 
some form of formal instruction in grammar with 
those who have not. at various grade levels. One 
question we have, other than whether teaching 
grammar perse makes a (difference at this age level, 
is how early it can be introduced so as to make a dif- 
ference. Equally important is to determine whether 
one form of grammar teaching is better than anoth- 
er at the early elementary level. 

6. An important issue in the interpretation of the im- 
mersion results IS whether certain grammatical as- 
pects have not been acquired/learned because the 
sociocultural needs are not present, or because 
grammar has not explicitly been taught. Although 
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we believe that if the sociocultural needs were pre- 
sent in the /n7/77ed/afe environment through the pre- 
sence of native French-speaking peers, the French 
used by the Immersion children would be different; 
it is the case that the political climate and educa- 
tional systems in Ontario and Quebec make it diffi- 
cult to create such an environment in the 



classroom (s6e Swain. 1979 and Gene3ee. 1979 for 
a discussion of these issues). Given that this is the 
case, then we must turn to. and experiment with, 
other techniques Such as the explicit teaching of 
grammar. The extent to which it wotJid make a dif- 
ference is. as yet. an open question. 
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The reactor's role is a difficult one. He is called upon to 
critique work in which others have invested (nore time 
and effort than he ever will and to raise issues which 
otherwise might be overlooked. He must critique 
thoughtfully. De Avila. et al make the task easier since 
their work readily suggests a wealth of issues and ideas. 
I think that three reactions are appropriate to De Avila. 
et al.'s contribution to this monograph: delight an 
analytic attempt to understand the paper and projection 
vis-a-vis future work to build upon their work. I will ad- 
dress each reaction in turn: it is hoped, the manner in 
which the reactions are framed will fulfill my duties as a 
thoughtful critic. i 

DELIGHT: 

De A\/ifa should be congratulated for presenting Bilingu- 
al kjjucation (BE) beyond the linguistic format by which 
It is usually portrayed. By focusing upon the teaching 
and learning of content (science and mathematics) and 
by stressing the importance of higher order thinking 
skills, this paper draws attention to two major concerns 
that BE shares with mainstream education, De Avila's 
findings show that children in BE can rise to the chal- 
lenge of higher forms of thought in the classroom. His 
findings also challenge those in BE to incorporate the 
more general concerns of content and critical thinking 
into their own teaching 

The alternative approach — the stress of linguistic dif- 
ferences between populations — has had the effect of 
typing bilingual education to a conceptual and political 
straightjacket by casting it as being another form of 
compensatory education We find mainstream teachers 
who do not believe that limited-English proficient stu- 
dents can learn: we must contend with public (and ad- 
ministrative) attitudes and bases against language dif- 
ferences and/or the allocation of resources for com- 
pensatory purposes. Thus, the policy implications of De 
Avfla's approach, one which raises issues that lie at the 
heart of mainstream education, should not be 
underestimated 

For example. Diennes' pioneenng work demonstrated 
how children as young as fourth and fifth grades can 
engage in sophisticated forms of mathematical reason- 
ing (algebra, group theory) with the assistance of mam- 
pulatives (Diennes. 1960. 1963) Bell. Fuson & Lesh 
(1976) detail a vanety of ways by which to teach arith- 
metic and algebra through manipulatives and small 
Q "roup activities. For the mainstream, the issue is how to 
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make such skills and knowledge accessible to all 
students. For BE, the issue is how to adapt methods 
from the mainstream to serve the same ends. Regardless 
of the name, the goal is the same: to make accessible to 
children content-based knowledge and to challenge 
them to use higher forms of thought. De Avila's results 
show that it can be done. 

ANALYTIC ATTEMPT AT UNDERSTANDING: 

This study was intended to show that children in BE can 
learn science and mathematics when the subjects in- 
volve higher forms of reasoning than are commorlly 
stressed in the classroom. The first set of analysed 
demonstrated achievement growth by the MICA stu- 
dents beyond what they would be expected to show. A 
second set of analyses was used to determine the 
source of that growth — the regular BE program or the 
MICA program. De Avila found a similar pattern of over- 
all academic growth by both the MICA and BE (compari- 
son) groups. However, on each of the math concepts 
and the math applications subtests (each of which mea- 
sures higher forms of cognition), significant time by con- 
dition interactions (Table B) indicates that the MICA 
group had grown more than expected solely on the 
basis of pretest scores and participation in the BE pro- 
gram. This finding is heartening since it demonstrates 
the specific effects of MICA: what was meant to be 
taught (higher forms of thought) was learned. 

As regards general effects. De Avila argues that at least 
MICA did not have deleterious effects relative to BE. A 
slightly more optimistic interpretation suggests itself - 
that student participation in the BE program allowed 
the MICA program to make its effects felt. Since th<»re is 
no suitable comparison group to test this interpretation 
(students who took the MICA program but were not in 
the BE program), the reasoning to it is indirect, .t is un- 
Ukely that such a group (MICA, no BE) could nave ex- 
hibited the range of general and specific growth that 
the MICA group did. The growth of educational achieve- 
ment follows the maxim: the rich get richer, the poor 
get poorer Those scoring below the norm fall progres- 
sively farther and farther behind. Thus, without the inter- 
vention of the BE program, the two groups (MICA and 
companson) should have shown growth which was con- 
siderably less than the growth of the 50th percentile. , 
But not only were their gam scores comparable to that 
percentile, they actually surpassed it. It is very unlikely 
that the MICAs program could, by itself, counter the ex- 
pected trend, i.e.. BE made possible the MICA effects. 
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The difference In interpretation is subtle, but important; 
it supports suggestions that good pedagogy can be 
transferred from the mainstream to the bilingual 
classroom and that the two need to work hand in hand. 

Other findings are worth noting. The teachers' failure to 
predict the "problem" group should provide some 
solace to those who worry that similar predictions are 
self-fulfilling. On the other hand, this result may be due 
to the teachers' unfamiliarity with the curriculum and 
Its demands. Future work might investigate this 
possibility. 

jhe need for social interaction in laboratory-based cur- 
ricula might uriderly two of the more interesting findings 
— the number of children who scored high on both En- 
glish and Spanish LAS increased from pre- to post test; 
and the existeriue of a group of over-age children fOr 
whom the MICA effects were limited. The first result — 
increase in the number of billnguals — Is rather straight- 
forward. Laboratory ^approaches require small groups 
* % whose, partlcrpants constantly interact among them- 
selves as they discuss their findings and help each 
other understand what is going on. This Interaction 
sh&uld lead to Improved oral language skills, which are 
measured by the LAS. 

,X)n the other hand. If the over-age children failed to par- 
tictpate in the small group activities, we would expect 
the results which were found — virtually no growth on 
the English language measures (LAS. and the standar- 
dized tests) nor on the tests measuring understanding 
* of the content (they didn't learn the materials regardless 
of the language in which they were tested), despite 
their high scores on the Spanish LAS and on the CCS 
(they had a language for social forms of communication 
and were not "slow" students). Thus, the important 
question becomes how these children might have been 
isolated Because they were monolingual Spanish 
speakers, they might not have been able to participate 
in the rapid exchanges which characterize small group 
interactions if these exchanges included language 
mixing or if the exchanges were primarily in English. 
Both these patterns of language use would likely occur 
among the bilinguals in the group? Exchanges involving 
the monolingual English children would be predomi- 
nantly in English. That Spanish was not as important a 
language of exchange (therefore, this group could not 
use their Spanish skills) is supported by the fact that the 
increase in the number of bilinguals was due to mono- 
lingual Spanish speakers becoming bilinguals. not due 
to a change in the monolingual English speakers. 

Alternately, other forms of selection (e.g.. by teachers or 
by students themselves) based on the children's age 
and language seem reasonable Future research needs 
to investigate the nature of the interactions, which un- 
derly lab approaches to teaching math and science in 
the bilingual classroom, and the possibility that some 
groups of students are selected out. Finding such a 
group of children suggests that there may be limitations 
to the curriculum and pedag- gic strategy employed by 
De Avfia 

O 
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The methodology employed by this stMdy reflects the 
pragmatic compromises which are comm£m|y made by 
"field based" research. I will leave it to dthefs to sift 
through |he tradeoffs which were made in terms of limit- 
ed resources and the various validities of this study 
(Cook 8i Campbell. 1979). There is one weakness, how- 
ever, which might be remedied by later analyses. The 
choice of comparison groups was done too globally. De 
Avila formed the comparison group from students who 
attended "similar" schools, yet little data was provided 
to support that the students v\teTe, in fact, similar Fur- 
thermore, the data were analyzed by a series of two-way 
(pre/post by group) ANCJVAs. I would recommend a 
more refined strategy, since the analysea were based on 
Individual student scores and not aggregate school 
scores (though schools were selected). Either new 
groups should be formed by matching students along 
pretest scores, or analyses of covarlance (using the pret- 
est scores as covariate) should be employed. Such 
refinements might turn up differences which would oth- 
erwise be hidden. 

An alternate strategy In overcoming methodological 
weaknesses would be to conduct modified replications. 
That is the concern of the next section. 

FUTURE WORK: 

Research efforts replicating the above study might take' 
any (or all) of three directions: a. defining the connec- 
tions between the curriculum and the thinking skills it Is 
meant to foster; b. investigating the role of small group 
processes on the educational outcomes; and. c. study- 
ing predictors of observed effects. 

A. Thinking Skills 

As De AvIla notes in the initial sections of his paper, 
the notion of "higher" forms of thought Is found 
under many names and In a variety of contexts. 
Future work might attempt to better define the. 
specific forms of thought which the MICA program 
fosters. This could be accomplished by a longltudk 
nal study of some children in the program. Repeated 
assessment of the hypothesized forms of thought 
could include: 1) detailed analyses of student proto- 
cols while they attempt to solve problems which 
use those forms of thought and 2) standard mea- 
sures of those forms of thought. Both research 
strategies would require detailed task analyses to 
clearly show the kinds of thought required and how 
these forms are used. 

B. Group Processes 

As noted above, group processes might explain the 
overall increase in the number of bilingual students 
and that a group of students were resistant to the 
program's effects. Research is needed to investigate 
this possibility, as well as mechanisms by which 
groups form dunng laboratory experiences. 




C Predictor&.of Achievement 

. •. \ 

By mcludmg tests of L1 literacy in the testing pro- 
grams, we could obtain measures of LI BICS (Spa- 

. nish LAS). LI CALP (the LI literacy tests). L2 BICS 
(English LAS) and L2 CALP (standardized achieve- 
ment tests in English) (Cummins. 1980), By employ- 
ing a pre/post-test design, we could then Investi- 

.gate the interrelationships among the constructs 
and how they change over time. We could also get 
an empirical test of Cummins' hypothesis that for 
minority children. U- CALP is the most potent 
predictor of future/academic achievement (Cum- 
mins. 1980). / 

Future reseanzm should also consider adopting a 
variety of r^arch methodologies, rather than fol- 
lowing the/tlassic field-based designs. Due to trade- 
offs among methodological needs, policies which 
counter^'^such pure research considerations, and 
limiteq resources for research and evaluation, we 
seld/)m can draw definitive conclusions from large- 
sc^le studies (Cook & Campbell. 1979). The alterna- 
tive is to address the issues from a variety of per- 
spectives and to allocate resources differentially, 
based on the relative importance of thq competing 
questions and on the return promised by the various 
methodologies (Cronbach and Associates. 1980). I 
heartily endorse the alternative approach. 

For Instance, if all three*of the above directions were in- 
corporated Into a single study, the pretest battery could 



be used to classify ; students according to their 
BICS/CALP scores in |L1 and 12. using either factor 
analytic methods or i]hultidimensional classifications. 
Students could be raridomly drawn from each of the 
BICS/CALP groups and then be randomly assigned 
amonti the various studies. Numbers of children as- 
signed to each study/would depend on the Importance 
of that study as well/as methodological considerations 
for each question. Only those children who were being 
tested to evaluate the effects of the project (in the clas- 
sic sense of evaluation) would require a comparison 
group, which could be formed by carefully matching 
children attending other schools along some pre- 
selected variable (6.g.. achievement in English language 
tests at the pretest). This differential allocation of 
resources to address a variety of related questions 
would allow the researcher to study the program in 
greater detail than would otherwise be possible. The 
detailed results should also allow him to draw some 
definitive conclusions. 

Finally, from an educational point of view. I would en- 
courage other BE programs to replicate the pedagogic 
practices espoused by this study. The success of labora- 
tory approaches to the teaching of mathematics and sci- 
ence are many (Bell. Fuson & Lesh. 1976) It is unfortu- 
nate that greater detail concerning the MICA curriculum 
and its teaching methods are not provided by De Avila. 
However. BE programs interested in attempting a similar 
project should contact their local Bilingual Education 
Service Center for assistance, if required. 
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DILEMMAS IN DIAGNOSIS 
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One of the common uses of psychometric devices in the 
field of reading — as in education generally — is for 
educatior:ial diagnosis. Diagnosis is most often a matter 
that relates to a specific individual though we may be 
from time to time interested in making diagnostic judg- 
ments about groups. Diagnostic judgments are often 
based on the comparison of two measure? in order to 
judge whether the individual shows some genuine dis- 
crepancy in the traits or characteristics that the two 
measures represent. Thus, if a child falls at the 50th per- 
centile on a test of word knowledge, but only at the 
25th percentile on a test of comprehension of connected 
prose, the diagnostician must decide how much confi- 
dence to place in the conclusion that this child's ability 
to read connected prose falls short of his knowledge of 
word meanings. The whole armamentarium of diagnos- 
tic devices in the field of reading has its value- in sug- 
gesting judgments of the type "Ability A is greater than 
Ability B" 

But differential judgments about individuals are slippery 
customers. They are peculiarly subject to measurement 
error. Some 45 years ago. Kelley warned of the need for 
especially reliable tests if such diagnostic judgments 
were of be made with confidence. Nothing that has de- 
veloped since then has given occasion for the psycho- 
metrician to change his views on this point. The 
diagnostician, however, cannot wait for the psychome- 
trician to produce the perfect psychometric instrument 
in order to deal with the practical problems of his day to 
day functioning. He must get on with the job. And 
practical limitations of time and resources for carrying 
out his assessments mean that he will always have to 
use tools that fall short of psychometric ideals. 

This being so. what help can psychometrics offer the di- 
agnostician to "carry on" while he waits for the perfect 
diagnostic battery? Perhaps some guidance on the level 
of confidence that he should place in diagnostic judg- 
ments might be useful to tide him over 

We must always remember that any test, or any other 
type of behavior observation, represents only a limited 
sample from some domain of behavior. It represents the 
domain only imperfectly, and the score that it produces 
IS only an approximation to the score that the individual 
would get for the whole domain — or more realistically, 
that he would get on other samples drawn from that 
domain We get evidence on this variability from 
sample to sample of behavior through the various proce- 
dures for obtaining a reliability coefficient, and we ex- 
press It most usefully for our present purposes as a stan- 
dard error of measurement The standard error of mea- 
surement may be thought of as the standard deviation 



of a series of equivalent measures of the same individu- 
al, displaying the extent to which the measures scatter 
awayjrom his "true score." 

Suppose, now. we have two measures. X and Y. Forcon- 
creteness let us say that X is a measure of word knowl- 
edge and Y, a measure of paragraph comprehension. 
Suppose that results from the two measures are ex- 
pressed in a common equal-unit score scale, such as T- 
scores or stanines fdr a common sample of sixth grade 
pupils. Suppose that Peter differs on the two tests by an 
amount D, and for concreteness let us say that this dif- 
ference is 10 points on the T-score or 2 points on the ^ 
stanine scale, i.e.. a difference of exactly one standard 
deviation. How much confidence should '9^e have that 
this difference represents something real, and didn't 
just happen because of errors of measurement in the 
tvyo tests? How confidently can we expect a difference 
in the same direction, though obviously not of identical- 
ly the same amount, if Peter is retested with equivalent 
forms of each of the tno tests? 

In setting our level of confidence, we need to take ac- 
count of three things, two of which have already been 
mentioned. In the first place, we need to take account of 
the size of the standard errors of measurement for 'the 
two variables. The larger the errors — that is. the lower 
the reliability — the lower the confidence. The appropri- 
ate degree of confidence depends secondly upon the 
size of the observed difference between the two scores. 
The larger the difference, the greater the level of confi- 
dence. It depends finally, and quite critically, on the cor- 
relation between the measures. X and Y. of the two attri- 
butes that we are studying. The higher that correlation, 
the other two factors remaining the same, the less confi- 
dence one can have in the meaningfulness of the 
difference. 

Let us look at the rationale for these relationships with 
specific figures for a definite example. Suppose that the 
word knowledge test (X) and the paragraph reading test 
(Y) are each known to have reliability coefficients of .90 
for a sixth grade sample and that for the same sample 
that correlation between the two tests is 0.80. Consider 
Peter, who scored one standard deviation lower (relative 
to the standardization group) on the paragraph test than 
on the word test. 

For a single test with reliability of 0.90. the standard 
error of measurement expressed in standard deviation 
units, is: 

vn"~r"« >j1 0 90 (1) 



^'"prmted with permission from Walter H MacGinitie. ed.. Assessment Problems in Reading. Newark. Del . I.R.A.. 1 973. 



For the difference between two tests, both expressed in 
standard deviation units, the standard deviation of dif- 
ferences arising purely from measurement errors, which 
we might call the standard error of measurement of dif- 
ference is: 



r — r 



^ 5| 2 .90 - .90 - 



0.45, 



(2) 



Thus, a difference between scores of one standard devi- 
ation is equal to 



1.00 
0.45 



2.22 



standard errors of measurement of the differences. Turn- 
ing to tables of the normal curve, we find that a dif- 
ference this large or larger could be expected to occur 
in 1 3 cases out of 1 .000. 

A parallel formula gives the standard deviation of dif- 
ferences between two tests when one knows what the 
correlation between the two tests is. When, as before, 
each test's scores are expressed in standard deviation 
units, the formula for standard deviation of differences 
is: 



>l2-2r - ^2 - 2(0.80) - |4o"- 0.63 



(3) 



Thus, a difference between scores of one standard devi- 
ation is equal to 



Section I: Average Reliability = 0.98 



Diff •r«nce in 
\SD UniU 

?0.25 

; 0.50 

; 0.75 
/ 1.00 
1.25 
1.50 
1.75 
2.00 



Correlation between Variables 



.95 .90 .85 .80 .75 .70 .60 .50 .40 .00 

1 1 2:1 2:1 5:2 5:2 5.2 3:1 3:1 3:1 3:1 
9:1 20:1 



All others greater than 20 to 1 



Section H: Average Reliability = 0.95 



Correlation between Variables 

Difference in ^ 
SO Units .90 .85 .80 .75 .70 .65 .60 .50 .40 .00 



0.25 
0.50 
0.75 
1,00 
1.25 
1.50 
1 75 
200 



1 3 1 2 3:5 2:3 3:4 3:4 4:5 5:6 

54 2-1 5:2 3:1 7:2 7:2 4:1 4:1 

7:2 8:1 11:1 14:1 16:1 18:1 20:1 
13:1 



Alt thers greater than 20 to 1 



7:8 

9:2 



1:1 
5:1 



1.00 
0.63 



1.59 



standard deviations of the differences between these 
two quite highly correlared variables. Turnipg once 
again to our table of the i\iormal curve, we" find that 
given a correlation of this s\ze. differences this large will 
occur in 56 of 1.000 cases. Of these 56. on the basife of 
our earlier calculation, we should expect that 13 were - 
the result of nothing more than measurement error./This 
leaves 43 that represent presumably "real" differences. 
Thus, we may say that the odds are 43 to 1 3 or about 3 
to 1 that the difference is a genuine one. The bfetting 
odds of 3 to 1 represent one way of expressing th^onfi- 
dence that we should feel in the diagnostic judgment 
that Peter is better at word knowledge than at paragraph 
reading. 

Following the same rationale that we have used in our 
illustration, it is- possible to prepare tables showing the 
"betting odds" for representative combinations of relia- 
bility, intercorrelation. and size of difference. An illustra- 
tive set of such tables is presented in Table 1 

Table 1 

Confidence Tables for Diagnostic Judgments: Odds 
That an Observed Difference between Two Variables Is 
a Real Difference 
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Section III: Average Reliability = 0.90 



Correlation between Variables 

Difference In ^ ^_ 

SO Units .85 80 .75 .70 .65 60 .55 .50 .40 .00 



•88- 



025 
0.50 
0.75 
1 00 
1.25 
1,50 
1.75 
2.00 



17 

1:3 
4:5 
3 2 
3:1 



15 

1:2 
3:2 
3:1 
7,1 



14 
3:4 
2 1 
5.1 
12:1 



2 7 
1:1 
5:2 
7:1 
18:1 



1:3 1:3 2 5 2:5 2:5 1:2 

1:1 7 6 5:4 4:3 3:2 7:4 

3:1 31 7:2 4:1 4:1 5:1 

8 1 9 1 10 1 111 13:1 17:1 



7:1 20:1 



All others greater than 20 to 1 



Section IV: Average Reliability = 0.85 



Correlation between Variables 

Difference in ^ m^- m 

SO Units .80 .75 .70 .65 .60 .55 .50 .45 .40 .00 



0 25 
0 50 

0 75 

1 00 
1 25 
1 50 
1 75 
200 



51 



1 18 

1 6 
13 

2 3 
1 1 
53 
9 2 

7:1 



19 
1:3 
2 3 

4 3 

5 2 
4 1 

13 1 



17 
2:5 
11 
2 1 
4 1 
8 1 



1:6 
12 
8 7 
5 2 
5 1 
10 1 



2.9 2.9 
2 3 2 3 
3:2 53 
10 3 113 
7 1 8 1 



1,5 
1:2 
4 3 
3 1 
6 1 

14 1 17 1 20 1 



14 

3:4 
7 4 
4 1 
9 1 



14 

4:5 
137 

4 1 
10:1 



1:3 
1:1 

5:2 
61 
15:1 



All others greater th^^n 20 to 1 



Section V: Average Reliability - 0.80 



Correlation between Variables 



Difference in 


















.00 


SO Units 


75 


70 


65 


.60 


.55 


.50 


45 


.40 


0 25 


1 19 


1 11 


1 9 


1 8 


17 


1 6 


16 


1 5 


• 1 4 


0 50 


1 8 


\1 5 


14 


1 3 


2 5 


2 5 


1 2 


1 2 


2 3 


0 75 


1 4 


1 2 


2 3 


3 4 


4 5 


1 1 


1 1 


98 


3 2 


1 00 


2 5 


4 5 


11 


7 5 


8 5 


9 5 


2 1 


11:5 


3 1 


1 25 


58 


54 


2 1 


5 2 


3 1 


7 2 


4 1 


9 2 


7 1 


1 50 


1 1 


2 1 


3 1 


9 2 


1 1 2 


7 1 


8 1 


9 1 


13 1 


1 75 


3 2 


7 2 


6 1 


8 1 


11 1 


14 1 


17 1 


19 1 


39 1 


2 00 


2.1 


6 1 


11 1 


16 1 













All others greater than 20 to 1 



Section VI: Average Reliability = 0.75 
Correlation between Variables 



Difference in 
















.00 


SD Units 


70 


65 


60 


55 


50 


45 


.40 


Oi25 


1 33 


1 18 


1 13 


111 


1 10 


19 


18 


1 5 


0 50 


1 12 


1 8 


1 5 


1 4 


2 7 


2 7 


13 


1 2 


0 75 


1 6 


2 7 


25 


1 2 


3 5 


2 3 


2 3 


11 


1 00 


1 4 


1 2 


2 3 


6 7 


11 


6 5 


4 3 


2 1 


1 25 


2 5 


3 4 


IT 


7 5 


5 3 


2 1 


9 4 


41 


1 50 


1 2 


1 1 


2 1 


7 3 


3 1 


7 2 


4 1 


7 1 


1 75 


4 5 


8 5 


5 2 


7 2 


9 2 


6 1 


7 1 


14 1 


2 00 


7 6 


5 2 


4 1 


6 1 


8 1 


1 1 1 


13 1 


30:1 



Section VII: Average Reliability = 0.70 



Correlation between Variables 



Difference in 














00 


SD Units 


65 


60 


55 


50 


45 


40 


0 25 


1 47 


1 23 


1 16 


1 14 


1 12 


1 11 


16 


0 50 


1 20 


1 10 


1 7 


1 5 


1 5 


14 


2 5 


0 75 


1 10 


1 6 


1 4 


1 3 


2 5 


1:2 


3 4 


1 00 


1 6 


1 3 


1 2 


3 5 


7 10 


4 5 


7 5 


1 25 


1 4 


1 2 


5 7 


1 1 


1 1 


4 3 


5 2 


1 50 


1 3 


5 7 


1 1 


3 2 


9 5 


2 1 


4:1 


1 75 


1 2 


1 1 


86 


2 1 


3 1 


7 2 


8 1 


2 00 


J ^ 


3 7 


f) 2 


7 2 


9 2 


51 


14 1 



Section VIII; Average Reliability = 0.60 



Correlation between Variables 
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Difference in 










00 


SO Units 


55 


50 


45 


40 




■ 






1 ?0 


1 


f'O 


1 id 






1 h 


1 4 


i > ' 






' b 


^ 4 




1 Of; 


T 






; ^ 


4 ^ 


* :h 






'•' 




13 10 


: M> 


T 




'•: 


4 r» 


?. 1 








4 


f > f) 


3 1 






'i 4 




^> 


f ' 1 



Consider first Section III of Table 1 — the section for an 
average reliability of 0.90 — since 0.90 is a fairly repre- 
sentative reliability for good quality ability tests. Note 
first that no column is shown for an intercorrelation of 
.90 or higher between the two tests. Whenever the inter- 
correlation of two tests is as high as their respective reli- 
abilities, they are effective measures of identically the 
same trait. Differences between the two are then 
equivalent to (and equal in number to) differences aris- 
ing solely from measurement error: there is no basis for 
a diagnostic judgment, and any diagnostic statement 
should be made with exactly zero confidence. 

Note next that when the difference is small, the betting 
odds are low that this is a real difference, no matter 
what the correlation. In the row corresponding to a dif- 
ference of a quarter of a standard deviation, the odds 
that the difference is a "real" one range from 1 real dif- 
ference to 7 chance differences when the correlation is 
0.85. to 1 real difference to 2 chance differences when 
the correlation is zero. Most small differences are readily 
attributed to measurement errors, and our confidence 
that there is any "real" difference must be correspond- 
ingly low. 

Finally, in this table we can see the role that the correla- 
tion between two test scores plays in our confidence in 
the reality of any observed difference. This is seen per- 
haps as clearly as anywhere in the row corresponding 
to one full standard deviation of , difference — a dif- 
ference that would correspond roughly to falling at the 
70th percentile of a group on one measure and the 30th 
on the other. For a difference of this size, our betting 
odds would be 3 to 2 in favor of a "real" difference if the 
correlation between the two test scores were 0.85. 3 to 
1 if the correlation were 0.80. 9 to 1 if the correlation 
were 0.60. and 17 to 1 if the correlation were zero. The 
confidence we should have in a diagnostic judgment 
rises sharply as the correlation between the two mea- 
sures on which the judgment is based decreases. 

To view the effect of test reliability on the confidence 
appropnate for our judgments, it helps to arrange the 
tables in a somewhat different way. Table 2 shows the 
"betting odds" when the size of the difference between 
X and Y is fixed at one standard deviation, but the 
values of the average reliability and the intercorrelation 
are allowed to vary. This table makes it emphatically 
clear how crucially one's confidence depends upon the 
reliability of the measunng instruments. If the average 
of the two reliabilities is 0.98 (one should live to see the 
day when such measures are available!), even the smal- 
lest differences, l e . those of a quarter of a standard de- 
viation, can be accepted with great confidence as real 
and not the result of measurement error With a reliabili- 
ty as low as 0 60. a full standard deviation of difference 
justifies betting odds of less than even money, even 
when the correlation between the two measures is zero. 
For intermediate reliabilities, considerable confidence is 
justified if the correlation between the two nneasures is 
low. relatively little confidence is justified if the intercor- 
relation approaches anywhere near the reliability 
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Table 2 

Odds That an Observed Difference of One Standard 
Deviation between Two Variabtes Is a Real Difference 



Correlation between Variables 
RMliability .95 .90 .85 .80 . 75 . 70 .65 .60 .50 .40 .00 



98 
95 
90 
85 
.80 
75 
70 
60 



All greater than 20 to 1 

1 3 1 Remainder greater than 20 to 1 
3:2 3:1 51 71 8:1 9:1 
2:3 4:3 
2:5 



11:1 13:1 17.1 



2:1 
4:5 
1:4 



5:2 
1:1 
1.2 
1:6 



3:1 
7:5 
2:3 
1:3 



11:3 4:1 
9:5 11:5 
1:1 4:3 
3:5 4:5 
1:5 2:5 



6:1 
3:1 
2:1 
7:5 
4:5 
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What do the tables that we have looked at imply when 
this type of thinking is carried over to some samples of ^ 
actual tests with the reliabilities and intercorrelations 
that characterize them? 

Davis^ has carried out some of the most meticulous re- 
search on the differentiability of different types of read- 
ing skills. Among the abilities that he studied, two that 
were most readily distinguishable were word knowledge 
and drawing inferences. His tests had to be quite short, 
since he was measuring some eight different aspects of 
reading, so the reliabilities of these two tests were only 
.58 and .59. The correlation between them had an aver- 
age value of .45 in several sets of data. Given these 
values, the betting odds are only 1 to 4 that a difference 
of one standard deviation between scores on the two 
tests is "real"; for a difference of two standard devia- 
tions, the betting odds are 9 to 8. As they stand, the 
tests hardly justify diagnostic inferences even when the 
differences are very large. But these tests were short - 
only 12 Items each. If they were lengthened to 48 items, 
which might be a reasonable length for a test in practi- 
cal use. one estimates that the reliabilities would be in- 
creased to 85 and .86. and the intercorrelation to .66. 
Then the betting odds are respectively 5 to 2 for a dif- 
ference of one standard deviation and 80 to 1 for a dif- 
ference of two standard deviations. Thus, we see how 
very critically diagnostic inferences depend upon the 
reliabilities of the constituent measures. 

Two of Davis' tests that measure more similar functions 
are the test of inference and a test that calls for identifi- 
cation of the author's tone, mood, and purpose. Here the 
reliabilities are .59 and .63- and the intercorrelation is 
53 Given those values, the betting odds for the existing 
test are only 2 to 1 1 for a difference of one standard de- 
viation and 3 to 5 for a difference of two standard devia- 
tions Lengthened to 48 items, reliabilities become .84 
and 88 and the intercorrelation 0 75 For this leng- 
thened test, the betting odds are 5 to 3 that an observed 
difference of one standard deviation is "real" and 23 to 1 
for a difference of two standard deviations 

Let us turn our attention now to the Stanford Diagnostic 
Reading Tests, the distinctive value of which is pre- 
O 3umed to he in their diagnostic effectiveness Here, un- 



fortunately, the manual provides only single-testing esti- 
mates of reliability, and these are certainly somewhat 
inflated. We cannot know how much. If we take the fig- 
ures at face value, the average of the subtest reliabilities 
is 0.90 and the average of the subtest intercorrelations 
is 0.65. A more realistic estimate of alternate-form relia- 
bilities might be 0.85. If we assume that figure, and turn 
to Section IV of table 1 for reliability 0.85, we find figures 
in the column for intercorrelations of 0.65 as follows: 

0.25 S.D. (which would occur for 38% of children) 6 

0.50 S.D. (which would occur for 27% of childrer) 1 to 2 

0 75 S.D. (which would occur for 1 9% of children) 8 to 7 
1 .00 S.D. (which would occur for 1 2% of children) , ^ *° f 

1 50 S.D. (which would occur for 4% of children) 1 0 to 1 
2.00 S.D. (which would occur for 1% of children) over 20 to 1 

Thus, if we limit our diagnostic inferences to the one 
percent with the most extreme differences, our judg- 
ments will almost always have a real basis. If we set a 
lower threshold, and undertake diagnostic statements 
for as many as 10 percent of children, there wJI be a 
basis in reality for something like three-fourths of our 
judgments. If we set a still more liberal standard arid 
venture diagnostic statements based on observed dif- 
ferences for as many as 20 percent of the group, the 
statements will correspond to real differences only 
about half the time. 

Finally consider a set of data for the reading test of the 
Stanford Achievement Battery given once in the sixth 
and once in the eighth grade. For one suburban New 
York school system, the correlation between the two 
testings was .747. An estimate of reliability drawn from 
the test manual is .93. How much would a child have to 
change his position in his group from the first to the 
second testing for us to have an even-money bet that 
there was a real change? The answer comes out to be 
0.40 standard deviations. If a child were to improve his 
position in his group by four-tenths of a stan^Jard devia- 
tion (for example, from the 50th to the 65th percentile),^ 
it is a fifty-fifty proposition that this repfesents some 
degree of real change and not just the eff^t of measure- 
ment errors. 

The tables and illustrations that we have examined illus- 
trate the impact of reliability, intercorrelation, and score 
difference upon the confidence that one can logically 
place in an observed difference between two scores. 
They illustrate that over the realistic range of test relia- 
bilities, and using the kinds of pairs of measures that we 
are likely to want to use in diagnostic studies, the confi- 
dence is often distressingly low. But children with read- 
ing disabilities are there, and they won t just go away 
until that happy day when we have diagnostic tools of 
reliability high enough to permit us to make judgments 
of score difference at a high level of confidence. Therein 
lies our dilemma. Wherein do we find our salvation? 

If salvation exists, it lies in the fact that most of the ac- 
tions following from diagnostic judgments are reversi- 
ble, and if they are unfounded, they are likely to result in 
wasted time or effort rather any more crucial loss. In 
this re&pecL instructional decisions differ from selection 
and classification decisions, since these are typically 
permanent The young person who is denied access to a 
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particular educational irtstitution or job is not likely to 
be given a second chance. But if the special instruction 
in word-analysis si<ills that seems to be called for by a 
diagnostic reading profile is not effective* it is always 
possible to hold up. take stock, get new or additional evi- 



dence* and follow up some alternative hypothesis. Our 
tables of betting odds suggest how tentative our hy- 
potheses should often be. Fortunately, they often can be 
tentative. It is important that we keep them so. 
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Confusion, misinterpretations, and conflicting theoreti- 
cal beliefs characterize the language assessment land- 
scape as currently understood and practiced by college 
and university personnel. A somewhat similar, pessimis- 
tic picture describes the efforts of state and local public 
school personnel in classifying and measuring their 
"limited English proficient" populations and bilingual 
teachers, respectively 

Confusion is apparent in the plethora of terms that are 
used to identify some aspects of the langauge assess- 
ment process over a specific period of time. For in- 
stance, there are the terms, "language proficiency." "lan- 
guage dominance" "bilingual proficiency": another set 
includes "language aptitude." "language ability" "lan- 
guage attainment." "linguistic academic achievement." 
and "global language proficiency" With emphasis upon 
the biHngual individual, such classifications as "bal- 
anced bilingual." "equilingual." "comparably limited." 
and finally, "semilingual vs. alingual" have occurred in 
the literature produced by "academic scholars" and "pro- 
fessionals," respectively (Note 1). A careful analysis of 
these terms that have in common the notion of language 
assessment results more often in confusion than clarity. 

In addition, misinterpretations are another outcome of 
the incomplete definitions of these terms and their oper- 
ations. For example. "limited-English proficient" pupils 
whose functional language is English are programrhed 
to receive instruction in th^ir primary language to better 
understand such content-area concepts as "sets." "elec- 
tncity" and "democracy": or there are first grade child- 
ren whose native language is not English and who are 
taught in English only In the academic setting, misinter- 
pretations are evident in admission committees reject- 
ing promising bilingual teachers and administrators 
solely on the basis of standardized test scores that pre- 
sumably yield valid and reliable information about an in- 
dividual's English verbal ability. 

Besides these misinformed plans of action resulting 
from the process of language assessment, opposing 
theoretical frameworks continue to be debated, partic- 
ularly in the area of the meaning of the term, "language 
proficiency" While this debate attests to the vitality and 
dynamism of the fundamental disciplines (Note 1). espe- 
cially, psycholinguistics ^nd sociolinguistics. acadertiic 
scholars in these disciplines actively and learnedly con- 
tinue to refine their knowledge of the forms and func- 
tions of language, however, analysis of the types of defi- 
nitions accorded the notion of language proficiency to 
be discussed m a later section, reveals the ambiguity 
and vagueness in which this important concept is 
expressed 



The purpose of this article, then, is twofold. Based upon 
selectecj aspects of the theory of definition as proposed 
by Leonard (1967). issues resulting from the different 
conceptions of language proficiency will be highlighted. 
Properties of definitions, such as ambiguity and vague- 
ness, an^ basic rules for defining terms will be applied 
to judge Ithe completeness of current definitions of lan- 
guage proficiency The second aim of this article is to 
recommend certain guidelines in formulating definitions 
that can ^e useful to professionals and practitioners in 
their practical endeavors to determine this attribute of 
individual^ 



Selected Features of the Theory of Definition 



According \to Leonard (1967). one of man's great 
devices for communicating and clarifying the meanings 
of words or |Dhrases is the definition. A definition usually 
means the ^ct of stating the signification of a word or 
phrase* and \a statement of the signification of a word 
or phrase. \ 

Insofar as definitions are acts of stating, they are impor- 
tant pieces of productive discourse. The definer may 
have different purposes, usually accompanying the 
statements of^i the meanings of terms, i.e.. cognitive or 
pragmatic. A ^cognitive purpose is when the definer 
wants to affedjt or change someone's belief or knowl- 
edge about th^ meaning of a term: a pragmatic purpose 
Is evident if tfie definer's aim is to influence other 
people to use oertain words or phrases in a manner dif- 
ferent from that which they have employed in the past. 
Leonard (1967) has divided the types of definitions in 
the following original list of exhaustive pairs: 

I. Complete — Incomplete 

II. Nominal'!— Real 

III. Informative — Hortatory 

IV. Linguistic — Conceptual 

V. Extensiorjal — Intensional 

i 

These types of definitions will be clarified as the need 
arises in the next section, but they are presented here to 
indicate the variety of purposes for which individuals 
can use definitions.l(See Note 2 for their meanings.) 

\ • 1 

Definitions are espd^cialy helpful in eliminating ambigu- 
ous meaViings. Since most terms or expressions allow^ 
for more th^iionejilstinct meaning, e.g.. "run; the clock' 
runs, the stocking rdns. etc.." there are circumstances in 
which one cannot m\ which of two or more customary 
meanings of the expression was intended by the user of 
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that expression. This situation can be cleared up by 
defining each customary use of the term. 

Vagueness, on the other hand, presents a difficulty in- 
volving only one customary meaning for a word or 
phrase. The custom in question has never fixed the 
exact limits of what is included within the meaning of 
the term. In some circumstances, it is used in a more 
inclusive sense, in others, in a less inclusive sense, e.g.. 
"greater Chicago." "greater New York," Through the 
device of definitions, meanings of terms are clarified, 
thereby reducing vagueness. Figure 1 contrasts the two 
distinct properties of ambiguity and vagueness. 
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Figure 1 . (From Leonard, 1 967, p. 27) 

Two technical terms employed in the theory of defini- 
tion are definiendum and definiens. To illustrate these 
terms, consider the following expressions: 

1 . Set = a carefully defined collection of elements. 

Here the term to be defined is given first (definiendum). 
then an equal sign, and that. In turn, followed by the ex- 
pression which states the meaning of "set" (definiens). 
However, the normal conventions of English would 
permit any of these expressions: 

2. By a set is meant a carefully defined collection. . . 

3. A set is a carefully defined collection. . . 

4. A carefully defined collection of elements Is called 
a set 

5. Let us understand by the term "set" a carefully 
defined collection. . . 

What these five definitions have in common, despite 
their different linguistic contexts, is the expression — 
definiendum ("set") — to be defined and the expression 
— definiens ("a carefully defined collection of ele- 
ments") — selected by the producer as the means of 
communication. Example one will be the standard form 
m whch definitions will be presented In the rest of this 
article 

Regardless of the purposes, types and forms of defini- 
tions, certain basic rules for defining terms are useful 
because they propose certain criteria in terms of which 
one may criticize definitions that one reads or hears. 
Leonard (1967) divides these rules into three groups: 
rules stating //ferary requirements, those stating factual 



\ 

requirements, and those stating formal requirements. 
These rules, with brief explanations, follow. 



Rule I A definition should be as clear as possible. 

Applicable to all discourse, this Rule requires the 
producer to accompany the key statement of the defini- 
tion, or Its core, with additional remarksJhatjAy^ 
the kind of definition being^propx5serirT:e7irear^ 
linguistic, conceptual, etc, (See Note 2.) 

Unless this is done, or the context indicates the concern 
of the definer. the definition will not be as clear as possi- 
ble. In addition, the producer must express the definiens 
in language that the receiver can bis expected to under- 
stand. Finally. Rule 1 recommends the giving of com- 
plete, intensional definitions, unless the circumstances 
dictate another procedure. (^ complete, intensional defi- 
nition is one intended to explain completely what char- 
acteristics are in the total strict intension of the definien- 
dum. e,g.. "a circle " Qf a closed plane curve all points 
of which are equidistant from a given point called the 



center"; contrast this with "a circle 
perimeter area of a given size.") 
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Rule 2. A definition should avoid figurative and meta- 
phorical language. 

Since definitions are considered as ingredients of 
technical discourse, as such, they should not contain 
anything tending toward vagueness. Imprecision, or ob- 
scuf ity, Metaphorical or figurative expressions, e.g., "the 
devil" as ' the prince of darkness." are vague and impre- 
cise and do not have a place in any technical discourse. 
Therefore, the definiens must not employ vague or fig- 
urative language. 

Rules 1 and 2 state the literary requirements of 
definitions. 

Rule 3: The definiendum and its definiens should be 
coextensive. 

This Rule requires that the extensions of the definien- 
dum and definiens. respectively, be Identical; nothing 
must be In the extension of either term which is not in 
the extension of the other As illustrations, these viola- 
tions of Rule 3 are presented: 

a. Example: A Mazda ^ an automobile 

The definiens is too broad: its extension includes ob- 
jects that are not in the extension of the definiendum.) 

b. Example: A Mazda = Qf a red car used as a 
means of transportation. 

(The definiens Is too narrow: Its extension ex- 
cludes objects that are in the extension of the 
definiendum) 

c. Example: A Mazda = Qf ^ fed car 

(The definiens is both too broad and too 
narrow.) 
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Rule 4 An intensional definition must give the essen- 
tial characteristics of the term being defined 

In general Rule 4 affirms that the total strict intensions 
of the definiendum and the defmiens must be identical 
A tefit of this Rule would find out whether or not: (I) the 
characteristics "given" in the definiens are necessary 
members of the total contingent intension of the defi- 
niendum, and (2) that set of characteristics is necessarf- 
jointly peculiar to the extension of the definiendum. 
An example in which Rule 4 is not violated is the follow- 
ing: "Thesaurus *" pf dictionary of synonyms and an- 
tonyms" A violation^.of this rule is evident in this in- 
stance: "Giraffe nf ^ mammal with extremely long 
front legs and an exlfemely long neck;" although some 
essential characteristics are listed in the definiens, the 
qlefinition is Incomplete, omitting back legs and a black- 
blotched fawn, also essential characteristics of a giraffe. 

Rules 3 and 4 capture the factual requirements of 
definitions 

Rule 5. The simple definiendum of a definition must 
not appear in the definiens. 

Examples of violations of Rule 5 are: "a house — a 
house"; and "snake « the offspring of a snake." 'DeTmi- 
tions violating this Rule are said to be circular^' 

Rule fi The definiens of a definition should avoid the 
use of simple synonyms of the simple definiendum. 

Rule 6 IS applicable to conceptual definitions, but not 
linguistic ones, as in dictionary definitions. A violation 
of this rule involves the fallacy of word substitution, e.g.. 
"a wagon Qf a cart"; contrast this definition with, "a 
wagon a wheeled vehicle designed to be drawn by 
an mdepenaent source of power" 

Rules 5 and 6 address some of the formal requirements 
of definitions. 

These basic rules for definition, in addition to the pro- 
cess of definition, can be employed in understanding, 
clanfying, and constructing a definition of the frequently 
used term, language proficiency 

Current Definitions of Language Proficiency 

This section of the paper will summarize current defini- 
tions of the term, language proficiency, and apply the 
concepts and ru^es for definitions delineated previously. 

• In an unpublished manuscript, Farhady (nd.) argues 
that although current theories of language proficiency 
testing have generated numerous hypotheses, many 
have to be questioned because the term iS' inadequately 
defined "Langage proficiency is one of the poorly 
defined concepts m the field of language testing" (Far- 
hady. nd. p 3) Illustrations of this conclusion will 
follow Citing representative academic scholars and their 
definitions The standard form for definitions will be 
used 
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Briere (1972. p. 332). acknowledging the complexities . 
of the concept of language proficiency states his 
definition: 

Proficiency — the degree of competence or the 
capability in a given language 
demonstrated by an individual at a 
given point in time independent of 
^ a specific textbook, chapter in the 

book, or pedagogical method. 

' This definition is complicated and includes words that 
are vague and unspecified, e.g., "competence" could 
refer to linguistic, sociocultural, or other types of compe- 
tencies. Vagueness is apparent in the use of other terms 
such as "demonstrated" (how? orally, written modes) 
and "at a given point in time." Ambiguity is evident in 
viewing language proficiency as either a competence or 
capability. This definition, then, appears to violate Rule 
1 (clarity) and Rule 4 (requirements of an intensional 
definition). The concepts of "competence, capability, 
demonstration, given point in time" must be clarified, 
and the essential characteristics of the term must be 
identified. 

Addressing the area of proficiency testing, Clark (1975;^ 
p. 10) adds another feature to the term proficiency: 

Proficiency test - any measurement procedure 

aimed at determining thQ 
examinee's ability to receive 
or transmit in the test lan- 
guage for some pragmatically 
useful purpose within a real- 
life setting. 

/ In tKis definition, language proficiency includes another 
elem'i^nt. the use of language for real-life purposes. This 
definiWn, then, includes all the complexities of previous 
definitions \h addition to another concept, "real-life set- 
ting^." What consititutes a "useful purpose within a real- 
life letting" remains unclear and vague. In another arti- 
cle focusing upon the differences between direct and 
semi-dir^t tests of speaking ability, Clark (1979, p. 37) 
employs th^ term "global proficiency": 

Global proficiency ^ the examinee's ability to carry 

\ ^ out various language-use tasks 

\ ^ appropriately, and effectively in 

*^ \ realistic communication 

\ settings. 

This definition rerhairis incomplete and vague because 
"carry out" and "language-use tasks" are not specified: 
furthermore. Rule 3 is violated since the definiens is too 
narrow: it extlude? academic settings that require indi- 
viduals to be proficient in classroom communication. 

Upshur (1979), formulating a functional proficiency 
theory for language tests, distinguishes two kinds of lan- 
guage proficiency: 

1. Language proficiency: a relation between an 

individual and a situa- 
tion requiring the use 
of language. \ 
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2. Language proficiency: p.. a psychological capacity 

of an individual which 
together with other 
capacities enables him 
to function in a situation 
requiring the use of 
language. 

In Upshur's view, tests developed with this latter defini- 
tion in mind, essentially language tests for research pur*^ 
poses, seek to answer the question. "Does somebody 
have proficiency?" 

Although this conceptual distinction between two 
types of language proficiency merits closer attention be- 
cause of its implications for the validity of such tests, 
greater clarification of both definitions is necessary. For 
example, the use of the term "relation" in the first defini- 
tion remains vague, even in the vivid examples provided 
by Upshur: what kind of relation is meant? The second 
definition violates Rule 3 despite the following claim: 
"We find that the construct of proficiency has become 
virtually coextensive with human psychology" (Upshur. 
1979. p. 83). The definiens of the second definition has 
become too broad according to this statement; more- 
over the definition does not meet the requirements of 
Rule 4 whereby the isssential characteristics of the term 
are identified. 

These definitions of academic scholars, in particular the 
areas of linguistics, psychology, and psychometrics. 
have their origins in the testing of foreign language pro- 
ficiency. The next set of definitions addresses the educa- 
tional development of limited-English proficient indi- 
viduals, and varying theoretical viewpoints toward lan- 
guage proficiency are evident. 

In an article that delineates some guidelines for the as- 
sessment of oral language proficiency. Burt and Dulay 
(1 978. p. 1 78) present the following definition: 

Language proficiency: the degree to which an indi- 
vidual exhibits control over 
the use of the rules of a lan- 
guage for one. some, or all 
of its numerous and diverse 
aspects. 

These aspects include "the phonological^ syntactic, lexi- 
cal and semantic systems, and discourse and. stylistic 
rules for oral and written communication for different 
vaneties of a given language in various domains and 
social circumstances." This definition, in contrast to the 
previous ones that emphasize the individual's ability to 
use language for real-life contejcts. focuses attention 
upon the multidimensional nature of language proficien- 
cy, in particular, the subsystems of a language. What re- 
mains unclear and ambiguous in their definition is the 
phrase, "exhibits control." which later in the same article 
IS distinguished by means of two types of oral language 
elicitation tasks, natural communication vs. linguistic 
manipulation tasks, respectively. The former task, while 
demanding the communication of something to 
someone, yields the speaker's unconscious use of lan- 
Q luage rules, the latter in which the focus of the indi- 
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vidual is on performing a conscious linguistic manipula- 
tion of the language, demonstrates the individual's 
meta-linguistic* awareness. Two different meanings are 
implied by the phrase, "exhibits control": one is con- 
scious corltrol and the other is subconscious. Moreover, 
the nature and types of language rules are never 
clarified. Rule^ 4 is violated in this definition because the 
definiens does not list the necessary characteristics of 
the definiendum: proficiency is never clearly defined, 
whereas language is comprehensively covered. 

DeAvila and Duncan (1980. p. Ill) view the term as 
follows: 

Language proficiency the student's language 

skills in English which 
are learned in both 
school and natural set- 
tings. . . It is not 
necessarily dependent 
upon specific instruc- 
tion or content. . .lan- 
guage achievement is 
more likely to be depen- 
dent upon proficiency 
than vice-versa. 

Examination of the technical manual accompanying 
their test. Language Assessment Scales (LAS) (DeAvila 
and Duncan. 1975). indicates that a student's oral lan- 
guage proficiency is viewed as performance across four 
linguistic subsystems, instead of one. single aspect of 
language. These are: 1) the phonemic. 2) the referential. 
3) the syntactical, and 4) the pragmatic subsystems. 
Their definition represents a different theory of language 
proficiency from that of Burt and Dulay: in addition, it 
confuses achievement and proficiency and never speci- 
fies language skills that are learned in school and those 
acquired in natural settings. Rules 1 and 4 are violated 
in this definition. 

Cummins (1978: 1979) divides language proficiency 
into "cognitive academic linguistic proficiency" (CALP) 
and "basic interpersonal communicative skills" (BICS). 

CALP = the ability to make effective use of the 
cognitive functions of the language, i.e.. 
to use language effectively as an instru- 
ment of thought and represent cognitive 
operation by means of language. , 

An illustration of. CALP would be the individual who is 
in tune with the semantic complexity of a language, 
both denotative and connotative. and is capable of car- 
rying out cognitive operations in the language. CALP is 
in contrast with BICS. or the "surface" linguistic features 
such as pronunciation, vocabulary, fluency, sociocultur- 
al competence, accent. CALP is definitely an important 
concept because of the nature and requirements of 
classroom learning with its emphasis upon literacy 
skills and deriving meaning from printed materials. How- 
ever. CALP as a definition of language proficiency suf- 
fers from vagueness in the use of the phrase "cognitive 
functions": are analogies, synonyms, antonyms meant 
by these functions? Moreover, how does one's knowl- 
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edge of the technical aspects of language, e.g.. morphol- 
ogy and syntax, relate to CALP? Because the essential 
characteristics of CALP are not specified. Rule 4 is vi- 
olated, thereby making this definition incomplete. 

This analysis of current definitions of the term, language 
proficiency, based upon selected features of the theory 
of definition, has demonstrated the different theoretical 
viewpoints attached to its meaning. Issues relating to 
different assumptions of language and proficiency, to 
purposes in testing language proficiency, and to the 
meanings of this term were identified. Dieterich and 
Freeman have appropriately summarized the state of 
the art (1979. p. 2): "English proficiency — what it is to 
know English — is given different operational defini- 
tions in each theoretical, historical, and legislative con- 
text." In addition, this section has shown the need for 
greater clarity and precision in the use of this term on 
the part of those who will define it." What procedures, 
then, can be employed in the construction of a definition 
of the complex term, language proficiency? 

Some Guidelines for Defining Language Proficiency 

The following guidelines, although incomplete, are sug- 
gested as aids in assisting professionals and practition- 
ers in their practical endeavors to assess the language 
proficiency of their students. Before these guidelines 
are presented, the term, assessment, must be clarified 
in relation to measurement and evaluation. They are 
adapted from Ryan and Cruz (1974. pp. 4-5) and defined 
within a view of education that is contextualistic. i.e., 
educators — administrators, teachers, parents — are 
called upon each working day to make decisions about 
the curriculum and its relevance to the student. To 
enable these educators to make the best decision (and 
not necessarily the ideal one), various forms of informa- 
tion are gathered and interpreted.-Measurement evalua- 
tion, and assessment are terms that are applicable to 
the types of decisions made by educators in the context 
of helping students understand their world. 

Measurement: the application of a standard to a set of 
data. 

Example: (1) This pencil is 6 inches 
long. 

• (2) Josfe. a fourth grade 
pupil, is reading En- 
glish at the 2.1 grade 
level (according to 
test manual norms). 
Measurement: information 4- a 
standard 

Evaluation: the consideration of a set of measure- 
ment data in terms of specified priori- 
ties for change. 

Example: (1) This 6-inch pencil is 
not long enough to 
reach the floor 



(2) Since Josb is in the 
first month of grade 4. 
he's labeled as 
"limited-English profi- 
cient," or a "slow 
reader." 

Evaluation: Information 4- a standard 
4- priorities 

Assessment: a process or program of inventorying 
an individual's strengths and wea- 
knesses, skills, and attitudes that are 
useful in relating to the roles and sym- 
bolic conditions required by various 
educational tasks. (Note 3) 

Example: (1) Although .this 6-inch 
pencil is not long 
enough to reach the 
floor, this new pointer 
that folds into itself 
helps me do the job 
nriore effectively and 
easily. 

(2) Although this standar- 
dized test placed Josb 
at the 2nd grade level 
in reading, empirical 
observations of Josfe 
in class indicate he 
can read most mate- 
rials presented to him 
and interact in a posi- 
tive manner with 
other students in the 
class. 

Assessment: information 4- standard 4- priorities 4- 
context 

While the terms, measurement, evaluation, and assess- 
ment are interrelated, assessment is to be preferred as a 
basis for decision making in educational contexts. This 
conclusion is based upon the notion that the context or 
situation does make a difference. 

With these distinctions in mind, the following guidelines 
are presented in constructing a definition of language 
proficiency. Analysis of the definitions in the previous 
section had one common feature: individuals differ 
among and within themselves with regard to language 
proficiency. The use of such words and phrases as 
"degree of competence." "capacity." "control over the 
rules of language." and 'cognitive operations" is an in- 
tlicator of these individual differences: furthermore, 
classifications such as linguistic subsystems or types Oi 
relative language proficiency — "proficient bilinguals" 
"partial bilinguals" ~ point out the following rule: indi- 
viduals differ among themselves in language proficiency 
or ability. 
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One guideline, then, is to spec% or define the property 
with which one is concerned. \h\s definition, in turn, 
will yield a series of operations tl^at will allow the de- 
scription of individuals in terms ot that property. Ac- 
cording to Ghiselll (1964. p. 16). "a ^od definition of a 
variable is precisely formulated." Spfiicificity identifies 
the essential characteristics of the pr^erty and facili- 
tates the development of a series of operations that 
enable one to observe similarities and differences 
among individuals Adherence to the rules for defining 
terms, awareness of the definer's aims, and avoidance 
of vague and arnbigious expressions will help in the 
statement of precisely formulated definitions. 

Another guideline is to distinguish between what Ghi- 
selli (1964) calls "trait names" and "trait definitions" 
Trait names are employed to identify, label, and reasona- 
bly represent the definition of a property. As illustrated 
in the previous section, the label, "language proficiency." 
was used to represent several different definitions, and 
since the name comes from the definition, it would be 
mistaken to claim that one or the other is a better or 
more valid definition of "language proficiency." Care 
must be exercised in comparing the results of different 
definers or the findings with different language profi- 
ciency tests in which the same property is nominally 
involved. 

Another guideline of particular importance to bilingual 
educators is the kinds of individuals with whom one is 
trying to determine the property The kinds of individu- 
als of concern to educators have an important influence 
upon the way in which one may define the property. 
Ghiselli (1964. p. 18) captures this guideline: "variables 
defined in certain ways are not appropriate for certain 
kinds of individuals. A consideration of the nature of the 
individuals may require us to redefine the variable. Fur- 
thermore, the nature of the individuals may dictate the 
type of variable we conceive ours to be." Justification 
for this attention upon the nature of the individual is evi- 
dent in the use of such terms as language minority stu- 
dent, relative language proficiency, language domi- 
nance, and bilingual discourse: in addition, the articles 
by Farhady (n.d.) and Garcia (1980) address this relation- 
ship of learner attributes and language proficiency. 

These few guidelines pertaining to the definition of lan- 
guage proficiency were presented as suggestions to 
professionals and practitioners as they endeavor to 
assess the cognitive and linguistic capacities of different 
individuals. Once a clear and precise definition is for- 
mulated, one is In a position to develop operations that 
permit the observation of individal differences in the 
property (Note 4) This summary aptly describes the 
dynamic process of defining any important viariable. es- 
pecially language proficiency: 

the way m which we define a variable is a func- 
tion of the concepts, the theories, and the 
knowledge we have about the property and the 
individuals for whom it is a variable With one 
particular set of concepts, theories, and knowl- 
edge we might define a variable one way; 
whereas with different concepts, theories, and 
O knowledge we might define it quite differently. 
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As conceptualizations and theoretical formula- 
tions change and become more refined, and as 
our knowledge increases, our definitions of 
variables change so that what we once defined 
as a simple variable, we now see is a complex 
variable, and what we had taken as five dif- 
ferent variables we now see is one variable. 
Variables are not static, unchanging, universal 
truisms. They are modified, given up. and creat- 
ed as our concepts, theories, and knowledge 
grow and develop. (Ghiselli. 1 964. p. 1 9) 

This quote accurately portrays the language profi- 
ciency assessment landscape of today. 

Reference Notes 

1. "Academic scholars" refers to those individuals of 
an academic community who generate a funda- 
mental discipline (e.g.. linguistics, psychology, 
mathematics) that consists of a body of knowledge 
made up of pure and distinctive forms of informa- 
tion pertaining to so. 'i phenomenon; "academic." 
in this use of the term, means not constrained by 
practical consideration, or learned and scholarly, 
but not necessarily practical. "Professionals" refers 
to those Individuals responsible for establishing ap- 
plied or derivative fields of knowledge (e.g.. medi- 
cine, law. engineering) that include those bodies of 
information composed of concepts and terms from 
the fundamental disciplines and cognate "fields" to 
deal with practical problems and phenomena found 
in those aspects of the human condition to which 
these specializations pertain. See J. E. Hill (1 981 ) for 
a more detailed discussion of this important 
distinction. 

2. These ten types of definition provide a jointly ex- 
haustive pair. i.e. every definition is either complete 
or incomplete, nominal or real. etc. These types are 
defined as follows (Leonard. 1 967. pps. 608-61 6) : 



I, Complete definition: an act of definition intend- 
ing completely to give the signification of its 
definiendum. 

Incomplete definition: one intended only partial- 
ly to explain the signification of its definiendum. 



II. Nominal: one intended to explain what the 
definer means when he uses the definiendum. 
Real: one intended to explain the signification 
of a word or phrase as that word is used by au- 
thors other than the definer 



III. Informative: one aimed at informing its receiv- 
ers as to the meaning intended by authors who 
use the defined expression. 
Hortatory: one that recommends to its receivers 
that they adopt m their productive discourse 
the Indicated meaning for the definiendum. 




IV. Linguistic: one intended merely to explain that 
a certain word or phrase has such and such a 
presumably already familiar signification. 
Conceptual: one that at least in part, analyzes 
(instead of merely identifying) the mearrtng Of 
its definiendum. 

V. Extensionai: one intended to explain (complete- 
ly or incompletely) what objects are in the ex- 
tension of the definiendum. 

Intensionai: one aimed at explaining what char- 
acteristics are in the total strict intension of the 
definiendum. 



3. The framework of educational cognitive styles and 
the concept of matching styles with educational 
tasks are pertinent here. For more information, see 
Baecher (1976: 19B1a: 19B1b) and Hill (19B1) in 
the reference list. 

4. For more information on the nature of operations 
and those of classification, ranking and measure- 
ment, see Ghiselli (1964). 
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The National Defense Education Act (NDEA) was 
passed in 1958. The supposed tjtreat of Soviet superiori- 
ty in the sciences exemplifi^ by the first successful 
orbiting of a man-made satellite in 1957 proved the 
catalyst for wide-ranging r^^orms or at least alterations 
in traditional academic p^grams — particularly in the 
secondary schools — in mathematics, the sciences and 
modern foreign languages. 

Among the programs and areas receiving support 
through NDEA were: 

1 . Vocational Education Programs 

2. Research and Experimentation in More Effective 
Utilization of Television. Radio, Motion Pictures 
and Related Media for Educational Purposes 

3. Improvement of Statistical Services of State 
Education Agencies 

4. Guidance. Counselling, and Testing: Identifica- 
tion ahd Encouragement of Able Students 

5. Loans to Students in Institutions of Higher 
Education 

6. National Defense Fellowships 

7. Financial Assistance for Strengthening Science. 
Mathematics and Modern Foreign Language 
Instruction 

By the end of the fourth year of operation, the National 
Defense Language Institute's program had provided 
training for nearly one-fifth of the elementary and secon- 
dary school foreign language teachers in the U.S. Over 
ten thousand teachers had received instruction in new 

1 methods of language instruction and had improved 
their foreign language fluency at 218 language insti- 
tutes in the U.S. and abroad. There were foreign lan- 
guage institutes for Spanish. French, German. Russian. 
Italian. Chinese. Modern Hebrew and Japanese. 

By June. 1962. 65.000 elementary school pupils were 
studying a foreign language as compared with 5.000 in 
1959. In one Eastern state, foreign language enrollments 
m secondary schools were up 188%. 

New methods were developed. The "new key" or "audio- 
linguar method was "in"; "grammar translation" was 

2 "out", and Glastonbury. Connecticut became a mecca 
for foreign language teachers. Millions were earmarked 

f-H for language laboratories and new materials, both audio 
h3 and visual 
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There was. indeed, a revolution in foreign language 
teaching. New goals required new methods. The new 
methods required new resources — hardware and soft- 
ware. The revolution required new leaders — classroom 
teachers who could really understand and speak the lan- 
guages they taught. 

The goals of foreign language instruction before the 
1960's had been limited, in most instances, to develop- 
ing in the student some modest reading skills, mastery 
of grammatical rules and some translation ability. The 
objectives of the Spanish language classroom of 1955 
were not dissimilar to those of the Greek or Latin 
classroom of 1800. Generations of students who could 
barely read a sentence in French, and who could speak 
not at all could, with ease, identify the negative 
imperative, the regular, irregular and perfectly hea'thy 
past participles, as well as other grammatical esoterica. 

Foreign language teachers taught about language. In 
very rare cases did they actually teach language. This is 
not a criticism of previous generations of foreign lan- 
guage teachers. They knew what their goals were. They 
were disciplining the minds of their students. Communi- 
cation in another language was not a goal. If someone 
really wanted to learn to speak French or German, she 
could jolly well go to Paris or Berlin, and not clutter up 
the foreign language class. Besides, who would really 
want to talk to foreigners anyway? 

Testing has usually been a mirror of instruction. The 
goals and objectives of the instructional program 
should be reflected in the tests. Thirty years ago most 
foreign language tests were reading tests. When stu- 
dents wrote, they usually wrote in English. They wrote 
the English translation for a foreign language word or 
phrase. 

A typical test consisted of a number of vocabulary ques- 
tions like this. The student sees a word in Spanish (or 
French or German) and five English words or expres- 
sions. The student selects the best translation of the 
Spanish word or expression. Sometimes they were 
tricky. They reversed the process. The student looked at 
one English word or phrase and five words or phrases in 
the foreign language. 

Then there were curious hybrid sentences. The student 
would see half a sentence in French and the other half 
in English. Then she would see five possible translations 
for the English half. 
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Students were also given passages to read in the foreign 
language. Parts of sentences were underlined and the 
students had to translate them. And always there were 
exercises where the student had to provfde a label. 
Name the tense, the conjugation, part of speech or de- 
clension. These exercises may or may not have indicat- 
ed whether a student could communicate in another 
language. 

The good peopliB who sponsored and carried out the 
NDEA foreign language program realized that new 
ways to test would have to be found if the new foreign 
language goals and objectives were to be properly 
assessed. 

The Modern Language Association of America request- 
ed of the U.S. Office of Education funding for the devel- 
opment of tests in all four language skills: listening, 
speaking, reading and writing in French. German. Italian. 
Russian and Spanish at two levels for secondary 
schools and a similar battery of tests for teachers and 
advanced students. The tests for teachers and advanced 
students included measures of civilization and culture 
and applied linguistics. 

The classroom tests were tried out with thousands of 
secondary school and lower division college students 
throughout the nation. The advanced tests were admin- 
istered to foreign language majors and to participants at 
NDEA Language Institutes in the United State and 
abroad. For the first time, measures of oral skills formed 
an integral part of a foreign language test battery in a 
national testing program. For the first time, large num- 
bers of high school and college foreign language stu- 
dents were expected \o demonstrate their ability to un- 
derstand and to speak the language they were studying. 

The MLA tests were an important contribution to for- 
eign language teaching per se. as reliable and valid mea- 
sures of language performance. But more important, 
perhaps, was the catalytic effect the MLA tests had on 
all foreign language testing and. indirectly, on foreign 
language instruction. 

The MLA testing program proved that it was feasible, if 
not always easy, to assess oral skills — listening and 
speaking — in a reliable manner, in classrooms from 
Hollywood to Harrisburg. 

The College Entrance Examination Board introduced op- 
tional listening comprehension tests in French. German. 
Italian. Russian and Spanish. Shortly thereafter, compo- 
site listening/reading tests were incorporated in the ad- 
missions testing program. 

The Advanced Placement Program of the College Board 
considered the development of "Language" Examina- 
tions in addition to the Foreign Language Literature 
Examinations. The late Paul Pimsleur. working with a 
leading publisher, developed batteries of tests in 
French. German and Spanish that provided measures in 
the four skills. Pimsleur also developed at that time a for- 
eign language aptitude test battery, so. too. did John 
O Jrroll and Stanley Sapon — The Modern Language 
ERJQtttudeTest (MLAT). 



Foreign language textbooks that had been fixtures in 
school and college programs were thrown out and re- 
placed by new texts accompanied by audio publishers* 
tapes, filmstrips and motion pictures. As part of the 
"packages." publishers offered tests. The tests, though 
often of questionable psychometric quality. (//(/ provide 
for testing listening and speaking skiHs in addition to 
reading and writing. 

Listening comprehension tests appeared in the exami- 
nation programs sponsored by the American Associa- 
tion of Teachers of French, of German, and of Spanish 
and of Portuguese. 

The days of promise of the early 60*s were followed by 
precipitous decline in foreign language enrollment of 
the late sixties and early seventies, the elimination of 
foreign language requirements, the virtual disappear- 
ance of language instruction in the elementary schools 
and apparent public disinterest in foreign languages. 

Foreign language testing could not be unaffected by the 
general condition of foreign language education. The 
composite tests in the College Board program and the 
optional listening comprehension tests soon disap- 
peared. The Modern Language Association classroom 
tests were never revised, nor were the Pimsleur Tests. 
From the mid-sixties to the present, there has been 
almost no new development in foreign language testing 
for the masses of secondary school and college stu- 
dents. Some significant work was done in the Advanced 
Placement Program of the College Board, particularly in 
tests of speaking and writing. However, the impact of 
the Advanced Placement Program, though great qualita- 
tively, is less so quantitatively since it reaches fewer 
than 12.000 students annually. It is interesting to note, 
nevertheless, that the increase in volume for the Ad- 
vanced Placement French and Spanish "language" 
examinations is significantly greater than that of the Ad- 
vanced Placement Program in toto. 

It is ironic that contemporaneous with the decline in 
volume and impact of academic foreign language pro- 
grams, there has been a dramatic increase in the 
number and quality of foreign language programs out- 
side the traditional school and college setting. The 
Peace Corps provided instruction in over 196 languages 
to over 27.000 volunteers between 1967 and 19B1 The 
Foreign Service Institute, the Defense Language Insti- 
tute, and other federal government agencies have 
taught common and uncommon languages to tens of 
thousands of their personnel. Proprietary language 
schools are thriving. Berlitz alone teaches 50 languages 
to an annual learner group numbering in excess often 
thousand. 

While foreign language testing was nearly stagnant on 
campus, off-campus developments were taking place 
that could well have a profound effect on foreign lan- 
guage education in the future. 

The Peace Corps, the military, the foreign service and 
the proprietary language schools were all under obliga- 
tion, explicitly or implicitly, to verify the level of ability 
achieved by their clients. New ways to assess foreign 



language performance were explored. Promising tech- 
niques were tried. out. Many were abandoned because 
they were invalid, unreliable or simply too impractical 
for operational use 

In assessment and measurement, there is regular refer- 
ence made to "formative" and "summative" evaluation. 
In foreign language education, "formative" evaluation is 
usually the testing or evaluation of progress through a 
program of studies. "Formative" evaluation in the 
classroom takes the form of chapter quizzes and unit 
tests that are based on the content of the lesson plans, 
the course outline and the basic text. "Summative" eval- 
uation should provide evidence of how well the learner 
has achieved the major goals or objectives of the pro- 
gram. In our field, summative evaluation should answer 
the basic questions: 

How well does the student understand and speak 
the language studied? 

How well does the student read and write the lan- 
guagestudied? 

In many foreign language programs, "summative" evalu- 
ation is simply formative evaluation on a grand scale. 
The final exam is simply a compilation of elements from 
the progress tests given throughout the course. 

The danger inherent in evaluation of this sort is that suc- 
cessful performance on measures bound to. and reflec- 
tive of. the course of study may not necessarily reflect 
real ability to communicate in the language. 

That a student can memorize and recite some lines of di- 
alogue and correctly replace elements in a pattern drill 
does not mean that she can meet basic needs through 
the medium of the foreign language in a real-life context. 

One cause of the disillusion felt by foreign language 
learners and their consequent abandonment of academ- 
ic language programs may well have been the lack of 
correspondence between academic grades and real abil- 
ity to perform in the language. Whereas, in the school or 
college, the foreign language teacher is confined to a 
credit hour or semester system and is in direct competi- 
tion with other disciplines for the student's time, in pro- 
' grams such as those of the Defense Department or the 
Foreign Service, the student's total effort is applied to 
the acquisition of language skills during a concentrated 
period of time — six hours a day, for example, for forty- 
four weeks At the end of training, the learners must 
demonstrate ability to function in the foreign language 
in situations such as those they will encounter on the 
job An "A" or a "C is meaningless to an ambassador, a 
chief of mission or an artillery officer Faced with the 
need to evaluate the actual performance ability of'their 
graduates and the concomitant need to provide labels 
that would be readily understood by non-specialists. 
the linguists at the Language School of the Foreign Ser- 
vice Institute developed a rating scale to describe speak- 
ing ability and an interview-based evaluation procedure 
for assigning ratings Although the scale and testing 
procedure were designed for use with their own 
graduates, the Foreign Service Institute linguists were 
Q ^3\\e6 upon to administer their test and rate the per- 

ERIC 



formance of personnel from other government agencies 
such as AID and the Peace Corps. What is important is 
that the measure and the scale were equally valid ^hen 
used with those who went through the FSI course and 
with those who learned the language at home, in school 
or overseas. The interview and rating constituted a sum- 
mative evaluation procedure independent of any pro- 
gram of studies, a procedure that established as the cri- 
terion the student's performance compared to the 
speaking or oral interaction ability to be expected of an 
educated native speaker. 

Performance-based evaluation of foreign language 
skills, as exemplified by the language proficiency 
interview, has been adopted by agencies within and 
without the federal government. State and municipal 
education agencies have chosen the interview and 
scale for purposes of bilingual teacher certification. The 
Province of New Brunswick (Canada) uses the interview 
with secondary school students for evaluation of the 
provincial second language (French/English) program. 
Missionary groups, international student exchange pro- 
grams and major industrial organizations require the in- 
terview for assignment overseas. 

Unfortunately, evaluation procedures such as the Lan- 
guage Proficiency Interview are still little used in tradi- 
tional academic foreign language programs. 

In bilingual education, the need for evaluation of lan- 
guage performance in a real-life context is of paramount 
importance. 

In assessing the English language skills of children with 
limited proficiency in the language, the need is to deter- 
mine how well each child can function through the 
medium of English in the classroom. 

While many of the test exercises used in the foreign lan- 
guage classroom have application in the bilingual 
classroom, there are indeed some major differences. 

The most obvious difference is in the expected use of 
language skills. In most foreign language classrooms, 
the time lag between instruction and real life application 
of skills may range from months to never. In the bilingual 
context, there is usually immediate and almost simulta- 
neous application of newly acquired skills. Concomitant 
with the enhancement of motivation that comes with 
immediate gratification, there is the need for appropriate 
critena. In the traditional foreign language classroom, 
the criteria against which student performance was 
measured often had little to do with communicative 
ability. In the bilingual arid ESL classroom, the criteria 
are real-life communicative criteria exemplified by what 
IS going on in the English-medium classroom down the 
hall. The ultimate test is whether the youngster can 
function in f/?af classroom. 

The major measurement needs in bilingual education 
remain the same as they have been for over a decade: 

1. A means to determine whether a child should 
be placed in a bilingual program or not, 
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2. Procedures for determining mastery of subject 
matter in the bilingual classroom. (The whole 
issue of the optimum format for subject-matter 
tests in the bilingual classroom has yet to be 
resolved.) 

3. Reliable determiners of when a yourjgster is 
ready to profit from instruction given in the 
English language. 

Over a decade ago. Theodore Andersson and Mildred 
Boyer predicted that within ten years bilingual educa- 
tion would have to prove itself to its critics. 

Irrefutable evidence of the efficacy of bilingual programs 
would have to be presented to the uncommitted. The 
ten years have passed. We have had the AIR report that 
proponents of bilingual education all reject. Yet we 
have little or no evidence to present in support of our 
programs. 

Reliable evaluation and valid tests are an absolute 
necessity for bilingual education. We cannot use a ques- 
tionable test that shows our children to be reading dra- 
matically below grade level as justification for funding 
and then deny the validity of the same test when it indi- 
cates that our "treatment" of the children has had no 
effect or even a deleterious effect. We can't have it both 
ways. 

Assessment or evaluation is necessary to recognize defi- 
ciencies in a program and to discover strengths and 
weaknesses in students or in teachers. The information 
gained from such assessment should be used to im- 
prove instruction and meet the needs of children better 
The need for evaluation and assessment is clear, but the 
^procedures, the means, and the instruments, present 
roblems. To evaluate a program, the goals and objec- 
tive of the program must not only be stated, but must 
be stated m such a way that their attainment is clear 
and rmasurable. Affective goals are particularly difficult 
to defink in a measurable way. If there is no way to 
determineVhether goals have been achieved, the goals 
themselves tend to be no more than words. "Improved 
self-image " an^"better concept of self are laudable ob- 
jectives, but are of little value to a program unless beha- 
vioral manifestatiohs of the improved self-concept and 
jfnage can be perceiyed and evaluated. To evaluate a 
program, a variety of a^essmerjt tasks must be carried 
out — . aptitudes and atH|ities pf children, mastery of 
subject-matter content, language proficiency (in both 
English and the home language and in all four skills, if 
appropriate), and attitudes. Tft^' measurement of apti- 
tudes and mental abilities or 10 dfchildren who are not 
middle-class members of the dominant culture has long 
been under attack. The practice of (piling children into 
classes for the educable mentally /etanipd on the basis 
of an IQ test administered in English. wlWi their home 
language is other than English, has receiveoNyvidespread 
publicity and general opprobnum. \ 

If English-language IQ tests are inappropriate me^ures 
for non-English-speaking children so. too. can be ti^ts 
administered m the child's home language. Such test 
may simply be translations of English-language instru-\ 
nents with cultural irrelevancies intact, or they may be 
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instruments designed for children who. though sharing 
essentially the same language, come from a very dif- 
ferent cultural background. Before any foreign tests are 
considered for use with non-English-speaking children, 
they should first be scrutinized to determine the extent 
pf their l.nguistic and cultural "fit" for the population to 
be tested. 

Children in a bilingual program must be tested routinely 
in various content areas to assess their progress. The 
language medium for testing should be the same as the 
medium for instruction. If children are taught social 
Studies in French, the social studies test should be in 
French as well. Although this is recommended for early 
grades, there are indications that once linguistic compe- 
tence is attained in the second language, transfer of in- 
fbrmation across languages can readily be 
eiccomplished. 

Subject-matter tests are normally in one language and 
limited to two skills, reading and writing. If bilinguals 
have rn unequally developed set of skills in two lan- 
guages, it might be desirable to create measures in skill 
modalities that would conform to the linguistic 
strengths of the examinee. For example, if a student has 
a fully developed oral ability in Navajo, but no reading 
or writing ability in that language, and has moderate 
oral skills in English and moderate reading and writing 
skills in English, the optimum test modalities for him 
might be spoken Navajo and printed English If bilingual 
students require evaluation of their attainment in 
subject-matter areas, they also require continuous eval- 
uation of their linguistic development, both in English 
and in the home language. This evaluation is essential 
for decision making at various times and for various pur- 
poses. An initial assessment of language dominance is 
needed to determine whether there is a need for a pro- 
gram of instruction in the home language. This is done 
in some programs by means of a questionnaire to par- 
ents. Reliance on indirect indicators of language domi- 
nance such as surnames, ethnicity, and parents' lan- 
guage dominance is questionable. 

The most valid measure of language dominance is ob- 
tained through an assessment of performance in the lan- 
guage. Such assessments, despite their high validity, 
are open to questions of reliability because they depend 
on a rater's judgment. 

Evaluation of children's language abilities in English 
and in the home language should be carried out routine- 
ly in all language skills. Of particular importance is the 
determination of when the bilingual child is ready 
linguistically to enter the regular English-medium 
stream in the transitional bilingual program. In the 
bilinggal education legislation of one state appears the 
staternent that the child of limited English-speaking 
ability fcnust be enrolled in a bilingual education program 
for three years "or until such time as he/she achieves a 
level af English language skills which will enable 
him/h^.to perform successfully in classes in which in- 
struction IS given only in English whichever shall first 
occur" LegiiSlation in a number of states requires the es- 
tablishment of a bilingual program only if there are 20 
or more cfyldren of limited English-speaking ability be- 
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longing to a specific home language group. Problems 
with finances and staffing can lead administrators to at- 
tempt to mainstream bilingual children as soon as possi- 
ble. Btecause legislation does not specify the procedure 
used to determine that a child has achieved 'a level of 
English language skills that will enable him/her to per- 
form successfully in classes in which instruction is 
given only in English" great care must be taken to 
ensure that the child is indeed ready. What must be 
measured is the bilingual student's acquisition of the 
subject-matter content prerequisite for success in the 
Engltsh-medium class and ability in English in the four 
skills necessary for functioning in the classroom. The 
student's English language readiness must be assessed 
against a real-life criterion. This criterion might be those 
English language abilities in understanding, speaking, 
reading, and writing that are possessed by monolingual 
English-speaking students of the same age and grade 
who are at least minimally competent in the English- 
medium class What is needed is an analysis of mini- 
mum requirements in English listening comprehension, 
speaking, reading, and writing for success in each of the 
grades from kindergarten on. Instruments for assessing 
reading ability and writing for monolingual English 
speakers at different grade levels exist and may be 
useful with bilingual students. Some measures for 
listening comprehension and speaking ability have 
been developed. Much work still needs to be done to 
determine the validity and reliability of such measures. 

With any standardized tests, great care must be taken 
to determine the^r appropriateness with regard to the 
specific population with which they will be used. Some 
of the questions that should be asked are: Is the content 
sampled in the test representative of the classroom con- 
tent of the program? What are the characteristics of the 
reference population (the children in the norming 
sample)? Does the test require a skill that the children 
have not developed? (For example, must they be able to 
read in the home language to answer questions when 
they might not have been taught to read yet?) Is the 
vanety of the home language used on the test familiar 
to the children? Will the information to be gotten from 
the test be useful? 

Virtually all assessment of language skills in foreign lan- 
guage and bilingual education is being done by those in- 
volved in the training process They use instruments of 
their own design, or they use tests designed by the 
producers of the teaching materials — the textbook 
publishers In a Mexican university the other day. the 
head of the Engineering Department said to me that 
education is one of the few fields in which the manu- 
factunng and quality control are done by the same 
person Js there anything wrong with that? There can 
be There is no problem with that kind of measurement 
if the content of the course or the textbook really 
teaches language skills and the tests measure real lan- 
guage There is a problem, however, if the course of text- 
book content does not lead to development of real lan- 
guage skills The test. then, may accurately measure 
mastery of the course, and not tell us a thing about how 
well the examinee can communicate The student who 
Q does well in the French course gets an "A " The student 
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who does poorly in the course gets a "D." The assump- 
tion is that the "A" student speaks and understands 
French better than the "D" student. Maybe, if the final 
examination is a reading test only, then it is quite possi- 
ble that a student with an "A" might not speak a word of 
Frer\ch, A native Frenchman who was illiterate would 
fall. One of the most damaging practices in language 
teaching has been thb "internal accountability" system. 
If I teach Chinese and I alone evaluate my students, then 
I essentially define what is the Chinese language. 

For years American students studied foreign languages. 
They spent two. perhaps three, years in a foreign lan- 
guage class. They memorized their word lists. They 
learned their verb conjugations. They received their A's 
and B's, They fulfilled a foreign language requirement 
for a college or university. They thought they had 
learned another language, Heywood Broun, the humo- 
rist, once said that he had received an "A" in Beginning 
French. But when he got to Paris, he found that jiobody 
there spoke Beginning French. ^ 

These remarks are not a criticism of foreign language 
teachers. They did not develop accomplished speakers 
of other languages because they simply could not. 
Government agencies that teach foreign languages 
have determined that it takes from 250 to 400 hours of 
intensive instruction to reach a survival level. 250 to 
400 hours to be able to order a simple meal or to ask di- 
rections. It takes almost four years of high school lan- 
guage study to accumulate '400 hours of foreign lan- 
guage instruction. In four years the high school student 
is expected to learn to understand, speak, write and 
read the classics In the language and know the history, 
the culture, the mountains, rivers and lakes of the coun- 
try whose language he is learning, ImtDossible! 

We are now enjoying a reawakening of interest in for- 
eign languages on the part of government, commerce 
and the general public. There is the promise of federal 
funds. In the 1960's competence in foreign languages 
was considered essential were we to compete with the 
Soviet Union in outer i^pace and in the battle to win over 
the hearts and minds of the uncommitted around the 
world. Today we are told that competence in other lan- 
guages is essential if we are to compete with the 
Empire of Japan in the market place. Whatever the moti- 
vation for It. we are all pleased with the attention given 
us and our m-ssion. and we hope for more tangible evi- 
dence of interest such as the NDEA programs of the 
1 960's. 

This time, however we will need to be more cautious 
when we accept the gifts. Because, even though it may 
not be obvious, there will be a string attached. The 
string is an expectation that this time we really will 
teach people to communicate effectively in languages 
other than English. 

In order to understand what our needs are. in order to 
monitor student progress in the acquisition of foreign 
language skills, in order to assess the level of mastery of 
individual learners and m order to determine the kind of 
language performance requisite for different tasks and 
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occupations, we will need reliable, valid and practical 
measurement tools. 

There are three critical needs that must be addressed: 

1. Improvement of the test development and test 
interpretation skills of classroom teachers: 

2. Development of tests of receptive skills — 
listening comprehension and reading for na- 
tional administration: 

3. Development adoption and dissemination of 
common descriptors of language performance. 

Most language teachers have had little or no training in 
test development and test interpretation beyond an in- 
troductory statistics course. The number of foreign lan- 
guage testing courses offered in the United States is in- 
significant. Yet major decisions regarding students and 
programs are made on the basis of tests that are often 
unreliable and inappropriate. 

In the training or retraining of bilingual and foreign lan- 
guage teachers, provision must be made for the devel- 
opment of testing expertise. The focus should be on 
how to design tests to reflect the objectives of the curri- 
culum and how to create the appropriate measures for 
testing each of the language skills. In addition, teachers 
should be taught how to review external tests to deter- 
mine their appropriateness and how to interpret stan- 
dardized test results. Teacher training institutions 
should be urged to create language testing courses and 
should be given the necessary resources to initiate such 
courses. Testing should be included as a major area in 
language teachers' in-service training programs. 

The foreign language test batteries developed in the 
1960's are in desperate need of revision or replacement 
Skills of reception listening comprehension and read- 
ing — are most amenable to large-scale objective test- 
ing procedures. It is recommended that listening and 
reading tests covering a wide range of abilities be devel- 
oped in the most commonly taught languages. If the 
range of abilities tested is broad enough, then the same 
tests can be used m secondary school, college and uni- 
versity from beginning through advanced levels of lan- 
guage training. Separate norms for specific sub-groups 
can be developed to provide for meaningful interpreta- 
tion of scores. Multiple forms or versions of the tests 
could be developed periodically. Older versions could 
be made availabje to individuals for self-testing. Longi- 
tudinal studies Of language development would be faci- 
litated by having the same kinds of data available year 
after year. Significant economies might be effected by 
making unnecessafry a proliferation of language tests for 
different programs. 

Too often we d6n't know what we mean when we make 
statements regarding language ability. We have no way 
of describing our "product" There is a real need for all of 
us invoived/m foreign languages as teachers, as stu- 
dents, as^mmistrators and evaluators. to have availa- 
ble a conimon measure, a common, accepted set of de- 
scriptors of foreign or second language ability, so that 
Q everyone will know what we mean when we put a label 
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On someone's language performance. The "A^s. the "B^s 
and "C"s. the Superiors. Goods. Averages and Poors, the 
95's and 80's and 60's obviously have all lost more than 
something in translation. We. the language specialists, 
whose raison d'etre is communication, have failed abys- 
mally in communicating with one another. 

The idea of common standards or common, universally 
accepted criteria may be abhorrent to some. It may 
smack of regimentation or an abrogation of academic 
freedom. On the other hand, the adoption of criteria that 
are comprehensible to all will allow us to be realistic in 
our aspirations. Here is a sample level description from 
the scale used by the Peace Corps, the Foreign Service 
and others. This is the criterion for Level 1 : "Able to 
satisfy routine travel needs and minimum courtesy 
requirements." 

The Level 1 criterion is further elaborated as follows: 
"Can ask and answer questions on topics very familiar 
to him: within the scope of his very limited language ex- 
perience, can understand simple questions and state- 
ments, allowing for slowed speech, repetition or 
paraphrase: speaking vocabulary inadequate to express 
anything but the most elementary needs: errors in pro- 
r. nciation and grammar are frequent but can be under- 
stood by a native speaker used to dealing with for- 
eigners attempting to speak his language." While ele- 
mentary needs vary considerably from individual to 
individual, any person at Level 1 should be able to order 
a simple meat ask for shelter or lodging, ask and give 
simple directions, make purchases, and tell time. 

If a person cannot ask for shelter or lodging, cannot 
order a simple meal or make a purchase, that person has 
/70? fulfilled the requirement for Level 1. 

The Foreign Service estimated that it would take some 
360 hours of intensive training in Russian for a learner 
of average aptitude to attain Level 1. This estimate was 
made on the basis of long experience with large groups 
of learners all evaluated by the same procedure. It is 
very difficult to argue that they should be able to devel- 
op Level 2 ability speakers in 200. 300 or 500 hours 
when solid evidence to the contrary exists. 

In the academic world, we have the freedom to give 
grades or credits based on criteria that we ourselves 
determine. Our clients, however, have different expecta- 
tions. They have been led to believe that if one receives 
high marks for two or three years of language study that 
somehow she will be able to speak French or German or 
fjortuguese. In fact were we to compare contact hours 
in school or college programs with intensive govern- 
ment programs, it would be evident that it would take 
about three years of high school study to achieve d 
Level 1 in Spanish. French or German and about four 
years in Hebrew or Russian. 

Adoption of a common scale will allow us to plan effi- 
ciently and to set forth clear, attainable goals. Is it not 
obvious that in the time required in an intensive lan- 
guage program with small groups of highly motivated 
students under optimum conditions for learners to 



achieve a survival level of performance, we cannot 
under far less than optimum conditions, have our stu- 
dents achieve even higher levels of communicative abil- 
ity in addition to an acquaintance with and appreciation 
of the literature and culture of the people whose lan- 
guage is being studied? 

School administrators, students, parents and the public 
at large must know that it is nonsense to expect us to 
give students even basic survival skills in another lan- 
guage in two short years. We must be able to show 
people what can be expected, realistically, in two years 
or three or four; in six or eight semesters. In order to do 
that, however, we need to be able to describe language 
performance in terms X\}aX everyone can understand, 

A great deal of work in the development and refinement 
of language performance scales has been carried out by 
vanous federal government agencies. Recently, a group 
of linguists and foreign language teachers from Great 
Britain, the United States. Japan and Germany, under 
sponsorship of the English Speaking Union, the British 
Council and Educational Testing Servjce. have met in 
Princeton and London to review existing scales and to 
suggest further refinements. 

Common descriptors of language performance facilitate 
the determination of minimum standards of language 
performance for specific tasks. 

The State of New Jersey wanted to determine what 
level -of ability in English and in the home languages of 
children in bilingual programs should be expected of 
bilingual teacher candidates. In order to answer'the criti- 
cal question of "How good is good enough?", the follow- 
ing study was carried out. 



Five-minute segments from twenty recorded and rated 
language proficiency interviews were selected. Only 
performances rated between 1 + and 4-1- were included. 
It was assumed that no performance below 1 + could 
possibly be acceptable and level 5 performances would 
all be acceptable. The segments were ordered as 
follows: 2. 4, 4+. 3. 1+. 2+. 3+. 3. 4-h. 3+. 2+. 2. 4. l-h. 
2+. 4. 2. 3+. 1+. 3. Eleven judges for English bilingual 
and eleven for Spanish bilingual received instructions 
emphasizing that their task was to judge whether the 
speaking proficiency of the person being interviewed in 
each segment was "at least minimally sufficient for this 
person to function adequately" in a bilingual classroom. 

The judges were, themselves, bilingual teachers, super- 
visors of bilingual programs and administrators in 
bilingual schools. The judges, of course, were not in- 
formed of the ratings and were instructed to judge each 
sample independently and without discussion. 

The results of the study indicated that there was a criti- 
cal level of ability below which most judges indicated 
that the performance was inadequate. 

Similar -Studies could be carried out for any number of 
job categories ~ travel agents, flight attendants, tour 
guides. 

We appear to be on the threshold of another promising 
era for language education. We must not repeat the 
errors of the past. We must know where we are and 
where we are going. We must have our goals set forth 
clearly and comprehensibly. Our objectives must be 
attainable. 

Effective measurement and evaluation will play a critical 
role in the process. 
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Introduction 



The title of this article Is somewhat ambiguous and re- 
quires further definition. Specifically, what will be treat- 
ed here is the following question: How do we test 
bilingual education teachers in order to be sure that 
they can teach bilingually in subject areas such as math- 
ematics, science, social studies, and so on? In order to il- 
lustrate the topic, I will use examples taken mostly from 
Spanish/English bilingual education. The same issues 
are pertinent, nevertheless, to most, if not all, of the 
other languages which are encompassed within the 
bilingual education movement (both through Title VII 
and tn the parochial school sector) in the United States. 
An additional observation must be made at this time: to » 
the knowledge of this researcher (who has conducted a 
rather extensive, but not necessarily exhaustive, search 
of the literature), the specific topic at hand has not been 
investigated. 

This is not to say that the general field has not been well 
addressed by researchers including linguists, educators, 
psychologists, sociologists, and other social scientists. 
It has The bibliography refers to a number of important 
works which impact generally on the topic at hand, 
such as Oiler and Parkings (1980), Jones and Spolsky 
(1975). and Clark (1978), all of which treat in detail the 
theoretical and practical aspects of the language assess- 
ment of teachers. Additionally. Oiler and Perkins (1978) 
review the questions of how-to assess the assessment 
instruments themselves In terms of their validity and ef- 
fectivenefo; a paper by Gonzalez (1980) treats the use 
of English as a Second Language materials in math- 
ematics education (but only focusing on curhcutum 
materials for the student and not an assessment of the 
teacher's competency); Zamora (1981) treats the issue 
of language instruction for bilinguals at the college 
level; and Keller jsee bibliography) has treated in some 
detail the question of choice of register in the bilingual 
classroom Nevertheless the observation stands that to 
the best knowledge of this wnter the topic of assessing 
language competencies in bilingual teachers of subject 
areas such as mathematics and science, is addressed 
for the first time here. Consequently, as with any paper 
that attempts to chart new territory. I am constrained to 
a rather high level of theory speculation and issue iden- 
tification, some of which it is hoped will provide heuris- 
tic value to the specified studies which are sure to 
follow in the coming years. 



This paper will contain four parts: a discusision of the 
issue of the appropnate language vanety or register for 
the bilingual education content area teacher, which at 
first blush would appear to be an easily resolved matter; 
a description of some issues of pertinence in the selec- 
tion of the types of tests which are generally available 
O ' r language assessment^an introduction to the ideal 
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qualities that an assessment i|istrument for evaluating 
bilingual education content area teachers ought to 
have; and finally, a return to ihe critical problem of a 
lack of a clear or well-defined language variety or regis- 
ter in subject-area bilingual education. 



What Language Variety or Register Should Be Assessed 
in the Bilingual Subject-Area Teajcher? 

A problem which has existed in bilingual education 
generally is which variety of alanguage (in this case, for 
illustrative purposes. Spanish) Should 'be used in the 
classroom? 

The answer has been often madfe in the form of one of 
two extremes. There are those .J who exalt the ethnic 
form of their locality and denigrate what the American 
Association of Teachers of S|:^nish and Portuguese 
(AATSP) has called "world standard Spanish." Converse- 
ly there are those who exalt "wSorld standard Spanish" 
and denigrate the ethnic or folkiform. The first group is 
often found in ethnic studies departments on the college 
campus or In alternative colleges, of which there are 15 
tO'20 mainly in the Southwest, and among the ethnic 
communities themselves, particularly among radical 
spokespersons for the Chicano and Boricua 
communities. 

The second group has been wel) described by Rolf Kjols- 
eth. It includes the majority of Spanish teachers, both 
nonethnic and ethnic. Often the ethnic Spanish teacher 
fears the ethnic variety of the language as the stigma 
from which he or she has only rOcentIv escaped. 

Of course. I am overgeneralizing the dichotomy. For 
example, .the AATSP. the professional group of most im- 
portance in the United States, has made very sensible 
statements with respect tp the potential domains of 
ethnic varieties of Spanish versus world standard Spa- 
nish. This is so because the AATSP has turned the ques- 
tion over to its professional linguist^ for public com- 
ment. However, the general membership of the AATSP 
tilts toward negative attitudes about United States ver- 
naculars of Spanish. 

let us<describe very briefly the course that can be taken 
by those who exalt the vernacular at the expense of the 
standard form. Lozano (1974) claims that "the regional 
varieties of Spanish in Mexico and the (United States) 
Scjthwest which share vitually the same morpho- 
syntactic charactenstics should be considered a bina- 
tional macrodialect." (p. 147). Lozano adduces some 
linguistic reasons for subsuming Southwestern United 
States Spanish and Mexican Spanish, but in the socio- 
political arena, his conclusions are nil. 
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The Chlcano is engaged in combat not only with the 
"Anglo establishnnent; but with the disapproving "Mexi- 
can establishment" of which even such a distinguished 
Mexican linguist as Antonio Alatorre (1955) may be 
taken as a representativf3 voice Alatorre compares the 
Chicanos to the moz'arabeso^ medieval Spam, intimat- 
ing that the former like the latter have served to intro- 
duce many foreignisms mto Spanish. He defines the Chi- 
cano (except that he uses the term pocho which is 
pejorative m Mexico) as a Mexican who permits himself 
to be seduced by the American way of life and for 
whom Mexican ways are contemptible and American 
ways unsurpassable. As for the language, it is the pro- 
duct of a border society "that has created a type of dia- 
lect or Creole in which elements of English and Spanish 
are fused " (pp. 11-15) 

Naturally, the Chicano. when , confronted with these 
sorts of stark expressions of prejudice on the part of 
Mexicans, is compelled to systematically minimize the 
Mexican element in the language and systematically 
single out that which is autochthonous. 

Let us look at the same phenomenon at the micro, 
rather than the macro, level Troike (1968) points out 
that "there are. m fact, several native dialects of Spanish 
spoken in Texas alone — even in a single city such as 
San Antonio or El Paso— and most of these are simply 
local varieties of the much larger regional dialect of 
North Mexican Spanish." Troike goes on to observe the 
classroom implications of these differences, but once 
again, the chances of implementing the fact of different 
subdialects in the Southwest into either a coherent 
corpus plan or a classroom pedagogy are obstructed by 
the overriding ideological exigency that Chicano Spa- 
nish IS the vehicle of Chicano self-identify. Thus. Fish- 
man's observation (1968) that while often linguistically 
salient differences go unnoted m society while others, 
perhaps of a more minor nature, are socially pressed be- 
cause divisiveness is an ideologized position, must be 
recognized as a sociolinguistic fact, a fundamental con- 
sideration in analyzing the development of in-groupatti- 
tudes toward the vernacular Moreover, these attitudes 
have had a clear impact on the educational process. For 
example. Gaarder (1977) attests to the fact that numer- 
ous Chicano Studies programs actively denigrate what 
he calls "world standard Spanish" and insist that for 
their purposes the only languages needed are English 
and "Barrio Spanish" 

Let us turn now to the second group Those who exalt 
the standard and denigrate the vernacular are a partic- 
ularly vexatious lot. I shall be brief here because of the 
widespread familiarity that linguists tiave with the 
types of arguments, pedogogically. or politically, that 
are made in this regard Kjolseth (1972) and also Steiner 
(1969,212-213) have characterized this purist approach 
which posits a single variety of language, or of culture 
for that matter as "correct" as one which involves "de- 
education." that is. the belief that the lower class Chi- 
cano. Boncua. or Cuban-American child has to be de- 
educated before he or she can be re-educated Kjolseth's 
graphic conclusion is strong, but vafid these sorts of 
people liken themselves to priests of education busily 
civilizin*^ the savages In fact, they are-e.ngaged in a type 
O if seK-arrogatinx^ educational colonialism. 
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In addition. I must note that there is In the United States 
a special intense hostility directed toward non-English 
vernaculars which linguistically display the evidence of 
daily English language contact. Specifically. I am refer- 
ring to what in this country is popularly termed as 
"Spanglish"— often used as a buzz word by "vernacular 
denigrators." analogical in its extreme negative connota- 
tions to terms such as barrio Spanish, ca/o. pocho. 
Nuyorican. or Rican. which are often proffered as terms 
descriptive of a new. vibrant language emerging from 
the United States Hispanic ethnic communities by "ver- 
nacular exalters." Those who use the descriptive term 
Spanglish for politico-pedagogical purposes often take 
it to mean a pidgin or a hybrid, the illegitimate fruit of 
English-Spanish contact. 

It is necessary, however, that we recognize the pro- 
foundly psychopolitical motivation behind the manifest 
antagonism to United States vernacular Spanish. Al- 
fonso Reyes, the Mexican thinker, once observed about 
his people. "Pity us who are so removed from God and 
so near the United States." It Is especially painful for 
many Spanish-speaking persons to see the English lan- 
guage affect Spanish because of the obvious analogues 
with imperialistic exploitation. Those feelings of frustra- 
tion are linguistically spurious, but are psychologically 
intense. Yet. while we can understand some of the moti- 
vation as attributable to a feeling of anger and frustra- 
tion directed toward what is seen as another instance of 
United States domination— what we cannot accept is 
the. result; stigmatizing both United States vernacular 
Spanish and its legitimate speakers. 

Between the two extremes, there are those who chart a 
middle course. These tend to be sociolinguists and in- 
clude such persons as Gumperz, Dillard. Kjolseth. G. 
Vaides. and Keller For the sociolinguists. the solution 
tends to be the fostenng of bidialectalism. The world 
standard variety is to be added to the ethnic variety that 
the child already brings to the classroom. 

This tug and pull with respect to differing language 
varieties as the medium of classroom instruction within 
bilingual education has had a profound effect on the 
actual creation of curriculum materials In 1974. the Na- 
tional Institute of Education funded a project evaluating 
approximately 1.000 curriculum titles in Spanish 
bilingual education. I was the linguist for that project, 
the results of which were later published by the Educa- 
tional Products Information Exchange Institute (popular- 
ly known as EPIE). The eight types of Spanish that we 
found to be in actual existence in 1974 in Spanish 
bilingual education programs were as follows. 



Types of Spanish Used in Instructional Programs 
for Bilingual Education in the United States 

1. Programs which use "world standard Spanish." The 
language is free of regionalisms. Some of the lan- 
guage may not be understood by United States Spa- 
nish speakers who use a regional or ethnic designa- 
tion instead of the standard one (e.g.. program uses 
autobus, but not canmbn-SW. nor guagua-HB. 



2. Programs which use language specific to particular 
regions or social groups of the Hispanic world out- 
side of the United States, such as Spain, Bolivia, or 
Chile. For example, these programs may use micro 
(Chile) or at/rocar (Spain), but not autobus, camibn 
or guagua. 

3. Programs which use language characteristic of all 
the regions and ethnic varieties of United States 
Spanish, (e.g.. program uses guagua and camibn, 
*btjt not autobud. 

4. Programs which use language characteristic of the 
eastern United States and the Caribbean (e.g.. pro- 
qxacr\ uses guagua. but not autobUs nor camibn). 

5. Programs which use language characteristic of the 
western United States and Mexico (e.g.. program 
uses camibn, but not guagua nor autobUs). 

6. Programs which use non-standard, non-Spanish (as 
in bad translations). 

7. Programs which use both the regional or ethnic 
varieties of language as well as the "world standard 
Spanish" variety, (e.g.. program uses camibn and 
guagua in addition to autobu^. 

8. Programs which use controlled "world standard 
Spanish." using only language in the standard for 
which there are no alternate regionalism or ethnic 
varieties (e.g.. program eliminates camion, guagua 
and autobUshom instructional materials). 

Clearly there are types of Spanish now in use which are 
totally inappropriate with respect to United States 
bilingual education. These include type 2 and of course, 
type 6. which tends to be a bad translation of an English- 
language program. Yet even once we discard these two 
types, there surely remain too many corpora. In addition, 
the viable corpora that do exist are often found to over- 
lap. They require the appropriate "compartmentaliza- 
tion." to use a term that Fishman has advanced. Type 1. 
which IS a rather common language vanety in which 
bilingual education materials are published, in my mind. 
IS most compellingly used for the type of bilingual in- 
struction we are discussing here: the content areas 
such as mathematics, the natural sciences, health, and 
so on. 

Types 4 and 5. for the relevant regions, recommend 
themselves for employment m transitional bidialectal 
education. When students enter the school system with 
only a knowledge pf their ethnic or regional variety, it is 
logical to build upon their knowledge, at least fpr the 
first year or two. by teaching them what they don't 
know on the basis of what they are competent in. This is 
particularly true with respect to the pedagogy of begin- 
ning language arts, with its extensive use of sound- 
symbol and picture-symbol matching techniques, all of 
which are short-circulated when a child uses the ethnic 
term instead of the standard one. when the latter is ex- 
pected by the pedagogy; for example, when the book 

Q expects the child to say puerco and the childs says 

1 1 /^"::/7a/7c/70 instead 



On the other hand, as a result of my participation in this 
massive evaluation. I hypothesized an eighth type, 
which at that time did not exist. Subsequently, a 
number of programs have been written in type 8. includ- 
ing one of my own. I believe that they successfully deal 
with the miscue? that crop up otherwise and therefore, 
are able to teach decoding, encoding, word-attack and 
word analysis skills in world standard Spanish without 
interference effects from the ethnic or regional variety. 

Finally. I find it hard to rationalize the use of either pro- 
gram types 3 or 7. Under the guise of completeness or 
fairness, they offer mind-boggling numbers of synonyms 
for the same meaning. In this sort of program, to give 
one simple example, northeastern children are bom- 
barded with southwestern Spanish in semantic areas 
such as the desert, agricultural communities, the moun- 
tains, the mines, which are simply irrelevant to them; 
the converse is true as well. To give an example, south- 
western children learn the words pibtana guineo, plb- 
tano dedo. and other plants from the banana family, 
when for their language and culture merely one term 
. suffices. Unfortunately, this trend is being exacerbated. 

Attempting to be all things to all groups, publishers 
have tended to supplant synonymy or dialectal equival- 
ence instead of pedagogical logic. Of course, from the 
publisher's point of view, it is wise td print a national 
edition, one that will sell everywhere. Perhaps the culmi- 
nation of this trend can be found in the Santillana pro- 
gram. Aprendiendo en dos idiomas. a slick reading pro- 
gram which is marred, for me. by the fact that the read- 
ing lessons feature a teacher pointing out the lexical 
varieties for different meanings in different parts of the 
United States and abroad as well. I can't fathom to what 
positive purpose we should teach first and second grad- 
ers eight synonyms for bail point pen and ten for bus 
when what we should really be engaged in is the expan- 
sion of their vocabulary to meanings that are totally un- 
known to them. To the extent that this synonymy pro- 
liferates in the classroom, we sacrifice language 
development 

I have stated earlier that Type 1 bilingual instructional 
materials, those which use world standard Spanish, 
would appear to be those most appropriate for use in 
such bilingual education content areas as mathematics, 
the natural or exact sciences, health, and so on. This 
would appear to be. at first blush, a fairly obvious point. 
On the one hand, the use of world standard Spanish is 
justified on the grounds that the applicability of the con- 
tent areas transcends any regional, sectarian or other 
circumscribed linguistic, cultural or political entity. One 
does not think of Chicano mathematics. ^o/7Caa biology, 
etc.. and therefore, the use of world standard Spanish as 
the medium of instruction for mathematics or biology is 
justified by the universality, the transcendence from eth- 
nicity of these content areas. On the other hand, and 
this is part and parcel of the same observation, there are 
very few. If any. ethnically marked lexical or other 
linguistic items which relate to the content ^ areas. 
Linguists judge the domains of the vernacular varieties 
to be centered in such areas as family, peer and kinship 
relations, church, popular art (dichos. corridos. etc.). Sci- 
ence and mathematics have most emphatically been 



the domains of the standard. ancLjthus there are no 
competing language vaneties in the vernacular for the 
lexicon of these fields. Since^ there is nothing for the 
standard to compete with in terms of vernacular varie- 
ties, the question of which language variety to •use 
would appear to be moot. (It isn't as we shall see later) 
The linguisitic base for instruction in the content areas 
has to be the world standard language variety, for there 
IS nothirig else>- 

So far I have limited myself to the language variety in 
which instruction should proceed, and I have arrived at 
the conclusion thatit^stfould be the standard. However, 
the topic of thfTpaper is the language variety in which 
bilmgual content area teachers should be assessed. 
Here the same arguments that I have just been making 
are even further heightened. Indeed, while I don't believe 
that it is either practical or profitable., one could make 
the argument, in theory, that the standard lexicon of the 
content areas, having no competing lexicon in the ver- 
nacular could be. for the purposes of instruction, in say. 
a transitional bidialectal education program at the early 
grade levels, introduced in common vernacular trap- 
pings. Thus, for first and second graders, early science 
and mathematics vocabulary from standard Spanish 
could be presented in the language (e.g. vernacular) 
with which the students are acquainted. 

This is all a rather speculative point that I am discussing, 
however, because it presupposes a vernacular grammar 
in which standard content lexicon would be embedded. 
For Spanish at least, the fortfe of the vernacular is lexi- 
con. There is very little difference in grammar between 
standard Spanish and the many vernacular varieties. 

Nevertheless, to move forward, whatever attraction that 
a speculative instructional format using vernacular 
grammar and standard content area lexicon has for the 
young child.. even this totally breaks down with respect 
to the issue of assessing the bilingual education teacher 
Clearly the content area teacher would have to know 
the standard variety of the language, both to have been 
trained and to be able to communicate with peers in the 
content (eg mathematics, chemistry, etc.) field. Ergo, 
the obvious conclusion is that the assessment instru- 
ment that would evaluate the linguistic proficiency of 
bilingual content teachers would, of necessity, be cast 
in the standard variety of the language. Although the 
bilingual education teacher must know much more thaq 
merely the standard in order to understand and/or 
teach the language arts and culture of the bilingual 
child, for the mere purpose. of assessing that teacher's 
competency in subject areas such as mathematics or 
science, an assessment instrument that is expressed in 
the standard is required. 

While we have arrived at a rather straightforward con- 
clusion that the assessment of bilingual education con- 
tent teachers should be in the standard, we have, unfor- 
tunately not exhausted the topic We shall return to this 
question of language variety later m this paper For 
while it IS true that Spanish at least does not have sig- 
nificant lexical competition between vernacular and 
standard with respect to the content areas of the 
bilirjqual fKiucation curriculum, the proliferation of 



scientific lexicon (sorpe less charitable souls might call 
it jargon) translajedtrom English into standard Spanish, 
and for whjGtIm Spanish no equivalence existed, has 
cause$Miifficulty for the bilingual educator and will, 
catfse difficulty for those whose responsibility it is to 
assess the linguistic competency of the bilingual 
educator. 



A Review of Issues Dealing with Test Formats for As- 
sessing the Bilingual Educator of pontent Areas 

Having briefly outlined a critical language issue to be 
considered in the construction of a proficiency test, let 
us turn our attention to test formats. One of the first dis- 
tinctions deals with the issue of "discrete point" versus 
"integrative" tests. The discrete point test assumes that 
all of the tasks and topics involved in language learning 
can be disentangled, listed, tested separately, discretely 
as it were. Phonemes, morphemes (free or bound), syn- 
tagms. lexicon, idioms, etc.. are tested separately. The 
advantage of such a format is that individual learning 
problems can be separated out and assessed without 
the confounding of intervening, "fudge" variables. A dis- 
crete point test is diagnostic in nature. It permits us to 
ascertain if the testee has acquired each unit of the 
language. Discrete point testing finds some of its basis 
in native language interference research (Lado. 1961) 
and in contrastive analysis. 

Integrative test formats are the natural opposite of the 
discrete format. Integrative test items combine numer- 
ous units of language into a single assessment, and 
thus, their claim is to greater validity. Performance on 
an integrative item is more closely tied to contextual re- 
straints, as in the cloze test in which, for example, a 
paragraph is systematically mutilated by deleting every 
seventh word. (The examinee has to fill in the blanks 
with the correct word or an acceptable substitute.) Con- 
trast the integrative cloze test with the tin/thin sound 
discrimination task, a discrete point item appearing on 
the Language Assessment Scales (LAS). It is readily ap- 
parent that the integrative test item involves a great 
many units of language, not merely one. This is both an 
advantage and a disadvantage. The advantage lies in 
validity, in the fact that in real conversation successful 
performance involves the use of numerous units of lan- 
guage in a single utterance. On the other hand, the dis- 
crete point test is especially useful to the evaluator in 
that one can identify each unit of language which has»* 
not been learned by the testee. While the integrative 
test may be a better indicator of language learning, the 
source of error or lack of competence can not be easily 
identified. 

Another element surrounding the subject of test formats 
relates to the issue of natural versus unnatural language. 
Natural language involves the use of structured and 
nonstructured communication tasks. Unnatural lan- 
guage involves the use of various types of linguistic 
manipulation tasks. Nonstructured communication in- 
volves a conversation between the student and the 
examiner, as is required on the Basic Inventory of Natu- 
ral language (BINL). Structured communication in- 
volves student responses to specific questions asked by 
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an examiner in a pragmatic Context. An example of this 
format Is the Bilmguaf Syntax Measure (BSM). 

Current bias runs heavily in favor of natural unstructured 
communication since this Is the type of communication 
the speaker participates m daily. There are certain disad- 
vantages to nonstructured natural communication 
tasks, however A great deal of speech must be elicited 
in order for a sufficient range of language structures to 
be obtained. Also, it is not possible to make judgments 
as to the testee's command of any structures not ob- 
tained since the situation discussed may not have 
called for them. Performance on such tasks is often 
dependent on personality since some testees are much 
more willing to talk expansively than others. 

Structured communication tasks seem to solve most.of 
these problems. One can diagnose the testee's ability to 
use the conditional or conditional perfect, for example, 
by posing a question the answer to which will require 
such tenses Thus, on the BHingual Syntax Measure, we 
find questions like "What would have happened if the 
dog hadn't eaten the food?" The most common re- 
sponse is "The king would have eaten it." Through the 
careful development of appropriate pictorial situations. 
It IS actually possible to construct a diagnostic profile of 
the child's internalization of numerous structures and to 
use the profile as a checklist. A disadvantage of struc- 
tured communication is that with this format it is not 
readily possible to ask questions which will elicit all 
structures. For instance, how would one elicit a ques- 
tion, a very common structure in conversation? 

Ariother concern surrounding structured communica- 
tion IS that It produces an Inflated indication of the test 
taker's proficiency. There may be discrepancy between 
the test taker's correct usage of a language form on a 
structured task and his/her usage of that same form in 
nonstructured speech. This relates to the overriding 
issue of validity and explains why nonstructured com- 
munication tasks are generally preferred. 

The question of the generalizability of performance on 
structured communication tasks to performance in non- 
structured communication is in desperate need of re- 
search. Nonstructured communication tasks generally 
require a larger corpus of language and a more complex 
scoring system, proficiency testing might be simplified 
considerably if there were research data indicating a 
strong correlation between the two types of tests. Unfor- 
tunately, such research has not been conducted to this 
date 

Previously we mentioned that some test formats 
employ unnatural language. Such formats involve either 
mimicry or manipulation. Mimicry is used on the LAS. 
the MAT'SEA'CAL the Del Rid Language Screening 

"'■ Test, the Northwestern Syntax Screening Test, and 
numerous others It is often used to see if the test taker 
can pronounce certain" words or sounds. In the form of 
sentence repetition, mimicry is considered to be an in- 
dicator of syntax acquisition (Natalicio. 1 979). Linguistic 
manipulation tasks are often based on a foreign lan- 
guage teaching technique known as pattern practice 

qT' us. a student may be given a sentence in the present 



and told to change it to the past. Since children and 
even many teachers, especially if they are in content 
areas, normally do not know grammatical terms, on 
tests that employ linguistic manipulation tasks, the 
examiner usually initiates the response for the student 
as in the following examples from the Spanish/Ameri- 
can Oral Proficiency TesriPolitzerand Ramirez. 1975) 

Today it's clear. Yesterday- 
He studies in the library. They... . 
Hace frio hoy. Ayer... 
El estudia en la bibtioteca. Ellos... 

It has been widely questioned whether linguistic 
manipulation tasks are valid measures of language profi- 
ciency. (Sanchez. 1976; Burt and Dulay. 1978). Indeed 
Politzer and Rami'rez (1975) found that they do not work 
with many students. It appoars that performance is 
related to a kind of "metalinguistic awareness" which 
can be defined as the conscious knowledge of the forms 
of a language. Some students seem to possess this 
awareness, while others do not, in spite of their profi- 
ciency in the language. Thus, there is good reason to be- 
lieve that linguistic manipulation tasks are less desirable 
measures of proficiency than structured communication 
tasks. 

An hypothesis that is relevant to the debate over natural 
communication versus linguistic manipulation tasks is 
that recently put forth by Cummins (1980). Cummins 
distinguishes between cognitive/academic language 
proficiency (CALP) and basic interpersonal communica- 
tive skills (BICS). CALP is defined as those aspects of 
language proficiency which are closely related to the ac- 
quisition of literacy skills in LI and L2. While the aspects 
are not defined, there is at least an implication that it 
refers to such things as a greater vocabulary and the in- 
ternalization of more complex linguistic structures. 
BICS, which Includes oral fluency and sociolinguistic 
competency, may exist independently of CALP. Accord- 
ing to Cummins. CALP is more likely to be assessed by 
linguistic manipulation tasks than by means of natural 
communication tasks. 

A Review of the Relevant Characteristics for an Assess- 
ment Instrument of Bilingual Education Content 
Teachers 

Now that we have reviewed the question of language 
vanety and register and discussed some pertinent as- 
pects of proficiency tests, we are in a position to make 
at least some preliminary recommendations concerning 
the charactenstics and parameters of an instrument to 
assess bilingual content area teachers. 

1. A preferable mechanism for assessment of>the 
bilingual content area teacher is to have the 
evaluation of proficiency for teaching subjects 
such as science and mathematics to be part of 
a general assessment of language proficiency, 
rather than a separate test. It would be more ap- 
propnate to create a science/mathematics/oth- 
er content area subcomponent of a general 
test, for these reasons, among others: 
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a The language which this test assesses will 
be the world standard variety. However ber 
cause It IS important that all bilingual edu- 
cation teachers, including instructors of 
content areas, know well the vernaculars 
of the students, this ethnically marked 
variety of language must also be tested. 
Convenience and practicality would dictate 
that the assessment of proficiency for con- 
tent areas be part of a battery which would 
test for knowledge of the culture and ver- 
nacular of the student population as well. 

b. Below, the suggestion is made that 
discrete-point testing would be appropriate 
for this type of assessment However, 
discrete-point testing is best used when 
combined with integrative test items. Thus, 
another reason for having the content area 
assessment merged into a more general as- 
sessment (which would use integrative 
test items) is highlighted. 

c. Reasons a. and b. are merely subparts of 
the more general proposition that while 
there ought to be an assessment of compe- 
tence for teaching subject areas bilingually. 
the nature of this assessment is restricted 
and can not encompass all that should be 
assessed in the bilingual educator general- 
ly. Since it would behoove the tester to 
certify the general proficiency of bilingual 
educators, the content area evaluation 
component would naturally be an optional 
assessment unit to be given only to those 
educators for whom it would be relevant 

Since, for diverse reasons discussed earlier, it is 
the formal code, the world standard variety, 
which we are assessing, the assessment proce- 
dure should be appropriately constructed to re- 
strict so-called "natural language." Margaret 
Zamora makes the same kind of observation for 
college-level language instruction programs. A 
relevant portion of her paper is worth quoting 
here 

- A language program for bilingual is essen- 
tially different from Anglo-oriented pro- 
grams in that it is designed to teach stu- 
dents who have a working knowledge of 
syntax, vocabulary, and pronunciation and 
to make maximum use of their limited 
competence. This competence, however, is 
adequate within a very reduced contex- 
t—the home environment. The bilingual's 
command of the target language is usually 
oral, familiar, and colloquial, hence, not 
suitable for formal communication. When 
the student is placed m an environment 
which necessitates the use of a vehicle for 
communication other than familiar conver- 
sation (for example, the written word), 
functional use of the language deteriorates, 
often to the point of incoherence. (Zamora, 
1981) 



3. It is fairly self-evident that the sort of assess- 
ment instrument that we are discussing here 
needs to focus on cognitive/academic language 
proficiency, rather than on basic interpersonal 
communicative skills. 

4 The assessment of proficiency for the bilingual 
instruction of content areas such as science 
and mathematics would appear to be focusing 
mostly on two elements: the lexicon of the con- 
tent area, and the knowledge of the correspon- / 
r — dences or equivalencies in LI and L2 of the lexi-/ 
con of the content area Discrete-point test for^ 
mats would seem the most appropriate for tf>e 
evaluation of such competencies for a number 
of reasbns: 

a. Discrete-point testing was itself based qn 
contrastive analysis which in this instance 
closely approximates a skill which the 
instrument itself will attempt to assess. 

b. The linguistic element to be assessed— lex- 
icons-lends itself to discrete-point test 

• formats. 

c. As I have observed earlier, if this assess- 
ment were to be -a subcomponent of a 
larger instrument then other parts of the 
battery (for evaluating language arts, cul- 
ture, etc.) could use integrative test 
formats. 

The Problem of a Lack of a Clear or Well-Defined Regis- 
ter in Subject Area Bilingual Education ^ 

I have observed, immediately above, that the assess- 
ment of language proficiency in the subject areas will 
redound mostly on the area of lexicon. We have further 
come to the ad hoc conclusion that ,the level of that 
vocabulary which should be evaluated is the world stan- 
dard. We have now arrived at a final, critical problem 
area in the assessment of bilingual content area teach- 
ers, at least with respect to the Spanish language. The 
problem is that much of the vocabulary (or jargon) of 
the content areas is borrowed into Spanish from English. 
This has been occurring in a haphazard fashion so that 
in many cases there is no clear lexical item in Spanish 
which can be said to be the standard. The problem is 
notthat for a specific lexical item in English there are. in 
Spanish, competitors between the standard variety and 
one more vernacular varieties. The problem is that an 
English concept (lexical item) is translated in a variety 
of ways by different speakers (more appropriately, writ- 
ers) of Spanish in accordance with subjective prefer- 
ences, and accordingly, there has been no standardiza- 
tion of lexicon. 

Other lexical items have not yet even begun the process 
of being translated into Spanish. For example, in the 
area of sports, we have basketball which has been 
translated into Spanish (with Royal Academy approval) 
as baloncesto. but which is more commonly used, in 
standard Spanish as b^squetboL In linguistics this is a 
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very common problem: there is quite a variety of Spa- 
nish equivalents or translations for the key words of 
sociolinguistics. such as pidgia standard, or code- 
switching. In social work, words such as foster parents 
have no one standard translation, but a variety of 
equivalences. Thus, once again there .is no easy way to 
determine within standard Spanish itself what the lexi- 
cal item is in the language because of competitors (or 
perhaps, more appropriately, embryonic or pre- 
competitors) within the standard, or because of the 
actual nonexistence of equivalent terms in Spanish for 
scientific.words in English. 

We return to the problem that we have too many cor- 
puses in Spanish and not enough standardization. There 
is a weak status base upon which United States Spanish 
rises. One fact is that authorized corpus planners of 
United States Spanish derive their authority from rather 
unique quarters, if they have any authority at all. A 
second fact, which returns us to the vernacular denigra- 
tion/exaltation part of this paper is that corpus language 
planning is not socially innocent. Technical expertise 
alone never seems to be sufficient— there are always 
habfts, attitudes, values, loyalties, preferences and 
ideologies among the planners themselves. 



In many instances. Spanish lexicon appropriate for 
teaching science and mathematics concepts is and will 
remain for. a long time either in flux or will not even be 
present. Full resolution of this problem awaits a long 
term solution based on the emergence of Spanish- 
speaking scientifico-linguistic academies, standardiza- 
tion mechanisms such as dictionaries, nomenclatures, 
grammars and research studies, and the emergence, 
above all, of a cadre of authoritative speakers who oper- 
ate either formally as in France and Spain or along infor- 
mal lines as in the United States, as a sort of national 
standardization model. 

Conclusion 

I have attempted to review the test issues involved in as- 
sessing the proficiency of teachers of bilingual subject 
areas. I have tried to apply the fruit of that review to a 
series of test construction recommendations. Finally, I 
have shown how the question of what to test, which 
would seem to be a fairly simple and forthright matter, 
in some instances will defy resolution. 



References 



Alatorre, Antonio 'El idioma de los Mexicanos," Revista 
de la Universidad Nacional Autbnoma de Mbxico, 
Vol. 10. (1955), 11-15. 

Burt. Marina and Dulay, Heidi "Some Guidelines for the 
Assessment of Oral Language Proficiency and 
Dominance." TESOL Quarterly. 12:2, (197B). 
177-192. 

Clark, John L.O. (ed.) Direct Testmg of Speaking Prof/- 
ciency. Theory and Application. Princeton. N.J.: 
Educational Testing Service. 1978. 

Cummins. Jim. "The Cross Lingual Dimensions of Lan- 
guage Proficiency: Implications for Bilingual Educa- 
tion and the Optimal Age Issue" TESOL Quarterly. 
14. 2,(1980). 175-1B7. 



Keller, Gary D. "Psychological Stress among Speakers of 
United States Vernacular Spanish," Luis Ortega, ed.. 
Introduction to Bilingual Education. New York: Las 
Americas-Anaya, 1975. 

"Constructing Valid Goals for Bilingual and " 

Bidialectical Education: The Role of the Applied 
Linguist and the Bilingual Educator," in Gary D. 
Keller, Richard V. Teschner. and Silvia Viera, Bil- 
ingualism in the Bicentennial and Beyond New 
York: Bilingual Press, 1976. 

Kjolseth. Rolf "Bilingual Education Programs in the 
United States: For Assimilation or Pluralism?" Ber- 
nard Spolsky, ed. The Language Education of 
Minority Children. Rowley, Mass.: Newbury House, 
1972. 



Fishman. Joshua "Nationality-Nationalism and Nation- 
Nationalism." in Language Problems of Developing 
Nations, ed. by Joshua A. Fishman, Charles A. 
Ferguson and Jyatirindra Das Gupta. N.Y: John 
Wiley and Sons. 196B. 

Gaarder. Bruce A; "Language Maintenance or Language 
Shift: The Prospect for Spanish in the United 
States," Bilmgualism m Early Childhood, ed. by 
Theodore Andersson and W.F. Mackey. Rowley. 
Mass.: Newbury House, 1977 

Gonzcilez, Phillip "English as a Second Language in 
Math Education." Journal of the National Assoc/a- 
tion for Bilingual Education.% A. (19B0). 93-10B. 

Jones, Randall L and Spolsky. Bernard Testing Lan- 
guage Proficiency Arlington. Virginia Center for 
Q Applied Linguistics. 1 975 



Lado. R. Language Testing: The Construction and Use 
of Foreign Language Tests. London: Longman. 
1961. I 

Lozano. Anthony./"Grammatical Notes on Chicane Spa- 
nish," The Bilingual Review. 1 :2, (1 974). 

Natalacio, Diane. "Repetition and Dictation as Language 
Testing Techniques," Modern Language JournaL 
63:4,(1970). 163-176. 

Oiler John W. Jr. and Perkins. Kyle. Language in Educa- 
tion Testing the Tests. Rowley. Mass: Newbury 
House. 197B. ' 

Research in Language Testing. Rowley. 



-115- 



Mass: Newbury House. 19B0. 



0 



Politzer. Robert L and Ramirez. Arnulfo G. The Acquisi- Trolke. Rudolph C. "Social Dialects and Language Learn- 

tion of English and the Maintenance of Spanish in a ing: Implications for TESOL/ TESOL Quarterly. Z 

Bilingual Education Program." TESOL' Quarterly. (1968), 176-180. 

9:2. (1975). 113-124. ^ 

Zamora. Margarita. "Language Instruction for Bilmguals 

Sanchez. Rosaura. "Critique of Oral Language Assess- at the College Level: Strategies for Self-Correction." 

ment Instruments." NABE Journal. 2. (1976). The Bllmgual Review.B'AA\QQ^). 

120-127. 

Steiner. Stan. La Raza: The Mexicar) Americans. N.Y.: 
Harper and Row. 1969. 



/ 



7r; 



ERIC 



-116- 



1^ ON ASSESSING THE ORAL LANGUAGE ABILITY OF LIMITED-ENGLISH 

PROFICIENT STUDENTS: THE LINGUISTIC BASES OF THE 
_\ NONCOMPARABILITY OF DIFFERENT LANGUAGE 

r! PROFICIENCY ASSESSMENT MEASURES 

ro 

CD by 
Ixi Benji Wald 

National Center for Bilingual Research 



O. INTRODUCTION 



This paper will explore the linguistic bases of the 
language proficiency assessment (LPA) tests 
widely used in the United States to classify primary 
school students as "limited-English proficient" (LEP) 
or "fluent." 

The order of discussion is as follows: the historical 
context which led to studies of the comparability of 
various LPA tests, the linguistic bases of various 
LPA tests, and empirical work showing that the 
linguistic abilities tapped by various tests are non- 
comparable because different linguistic abilities 
have different patterns of development and use. 

1 .0 THE ISSUE OF TEST COMPARABILITY IN 
LANGUAGE PROFICIENCY 



all measures of language proficiency and even language 
achievement tests (Oiler & Perkins, 1980), to extremely 
complex models factoring out mode and channel, e.g., 
spoken-written; production and comprehension; and 
domain, e.g., home, school, etc. Perhaps the golden 
mean is Cummins' claim that there are two types of lan- 
guage proficiency — • one related to school achievement 
and another which is not (e.g., Cummins, 1980). 

Confronting a potential chaos in the classification of stu- 
dents into LEPs and FEPs or LESs and FESs according to 
district choice, some statt» governments took an active 
interest in comparing instruments in order to see 1) 
how use of different instruments affected the LEP 
count, 2) whether or not an instrument had predictive 
value for school achievement. 



The concept of the LEP student is founded on the 
common-sense notion reflected in the Supreme Court's 
Law decision of 1974, that a meaningful e6ucai\on. an 
Amehcan civil right, means an education in a language 
that a speaker can understand. Since understanding, or 
comprehension, is a gradient notion of more or less, 
more often than all or none, the LEP student is one who 
fails to show adequate command of a language accord- 
ing to some chosen criteria. 

What the chosen criteria are, is where the notion of oral 
LP comes into play The legally sanctioned need to 
' identify LEP students, i e , to match actual students with 
the concept of LEP, led to a proliferation of LPA instru- 
ments available for district use. Literally hundreds of 
LPA instruments, especially for the initial classification 
of K 2 children, appeared either through commercial 
production or initiatives in individual districts (cf. Locks 
etal, 1978) 

On the face of it, the critena and implied notions of lan- 
guage proficiency are different for various tests. More 
deeply, the question has been raised whether various 
measures concentrating on different aspects of Ian- 
o guage are equivalent. Proposed answers to this question 
S have varied from a claim that all aspects of language 
52 proficiency are equivalent, eg. Oiler's recent claim that 
2 there is a global language proficiency (gip) underlying 



In 1979, the Texas Education Agency supported a study 
reported by Gillmore and Dickenson (1979) to compare 
five LPA instruments in Houston, Texas. 

Among the tests compared were three LPA instruments 
of further interest to us here; BINL, BSM, LAS. G & D 
gave pairs of LPA tests to 464 pupils between K and 12 
in six Houston districts. 

Among their findings; 

1 Comparability was poor to poorly moderate. 
BINL/LAS closest for K-2 (Kendall's TAU -48). 
BSM/LAS closest for 3-6 (Kendall's Tau .52). 

2. All together BINL was the hardest of the tests, 
classifying 73% of students as LESA (LEP), LAS 
was the middle (30%). and the BSM was easiest 

(19%). 

3 Of the three pairs, BSM/LAS agreed the most 
78% (N-40), BINL/BSM, 45% (N-51) and 
BINL/LAS, 49%(N-'34). 

4 Of the three tests, correlations (using Pearson's 
R) between the tests and achievement tests 
were only significant for LAS, but very modest- 
ly, eg, reading 31 , vocabulary .28. 



•The research reported herein was supported by the National Center for Bilingual Research through funds from NIE (No NIE- 
R 79:001 1) The opinions expressed do not necessarily reflect the position or policy of NIE, and no official endorsement should be 
inferred 

O iptr pr«9«nt6d at the Language Assessment Institute, National College of Education Evanston, Illinois, June 18, 1981 

ERJC 1981 117 7 / 



In 1980. the California State Department of tducation 
commissioned a comparability study of the BINL. BSM 
and LAS In that study, all three tests were given to over 
1.100 students in grades 1. 3 and 5 in five schools 
throughout California, none having a majority Hispanic 
student body (40% the highest, in La Puente). 

Their findings were similar to G & D in some ways, dif- 
ferent m others (Ulibarri et al.. 1 980). 

1. Different tests identified different percentages 
of the same population as LES. etc. (LEP) (As in 
G&D. 1979) 

2. BSM was the hardest at each grade level, but 
BINL shifted from easiest to second place at 
grade 3. 

3. BINL/LAS had the highest agreement, from 
45% at grade 3. progressing to 65% at grade 5. 

4. Despite 3. LAS and then BSM (except at grade 
5) alone showed significant association with 
reading level. 

Because of the comparability problem, each test having 
a different set of criteria for language proficiency and 
each producing different sets of LEP students from the 
same population. I have chosen to distinguish language 
proficiencyUom language abilities. 

I define language abilities as the abilities that a speaker 
possesses to use a particular language, abilities which 
in performance reflect knowledge, whether conscious 
or unconscious, of a language. I define language profi- 
ciency 3S a quantitative measure of these abilities, or a 
subset of them. For my purposes, LP presupposes an 
LPA test. The test converts language behavior proceed- 
ing from language abilities into a measure of LP. Using 
this terminology, to talk about a LEP is unclear unless 
the criteria are given. On the other nand. language abili- 
ties C3x\ be discussed independently of such classifica- 
tion schemes. If we can identify what a speaker can or 
cannot do using a certain language, we then have a 
basis for discovering what the speaker has to learn or 
how the speaker has to use what he already knows, in 
order to achieve a meaningful education. 

The following diagram schematizes the relationship be- 
tween language abilities and language proficiency ac- 
cording to the above discussion 
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Diagram 1 . Relation of Language Abilities to Language 
Proficiency 

The scheme begins with the speaker's language abili- 
ties. In the first stage of language proficiency assess- 
ment, the LPA instrument is used by the tester to elicit 
language behavior It is extremely important to recog- 
>'\\zfi that the relationship between language abilities 



(LA) and language behavior (LB) is not direct, but is 
mediated by the LPA elicitation. The LPA elicitation cre- 
ates a situation whose effects on the relationship be- 
tween langage abilities and language behavior cannot 
be dismissed. Sociolinguistic research provides ample 
evidence that both quantity and quality of language is 
mediated by'the social situation in which it occurs (cf. 
Labov. 1972: Mace-Matluck, 1980: Wald. 19B0. 19B1). 
This issue will become evident in the ensuing discussion 
of syntax. However, in this paper, the main focus of 
interest will not be immediately on the limitations of the 
tests, but the bases of the information that they provide 
about the speaker's language behavior. To this end. the 
criteria used for LPA scoring are focused on. 

Continuing with the scheme, the LB which is produced 
through the LPA elicitation is then scored according to 
the criteria of the particular instrument used. This is the 
second stage in LPA. The output of this procedure re- 
sults in the language proficiency (LP) classification. 

Thus, the relation of LP to LA is quite indirect, depend- 
ing first on elicitation and then on scoring. 

2.0 THE LINGUISTIC BASES OF LPA TESTS 

The most widely used LPA instruments, the LAS. BSM 
and BINL, and most others, focus on the core linguistic 
components o\ language. These components are: 



1. 



3. 



4. 



phonology: the pronunciation and perception 
of linguistic sounds. 

morphology: the processes of word formation: 
particulary. for English, the most frequently 
used inflectional suffixes, 
syntax: the processes of sentence formation: 
the organization of words into sentences and 
intermediate units, i.e.. clauses and phrases, 
lexicon: vocabulary: the inventory of sound se- 
quences and meanings paired into morphemes 
and words, especially nouns and verbs. 



Each of the instruments is restricted to one or several of 
these components and devises a scoring system 
through which to quantify the results of elicitations, 
leading to assignment of the speaker to a classificatory 
category above or below a cut-off point for limited and 
proficient speakers. 

What IS important to us here is that each of the instru- 
ments has a different specific focus on just what core 
linguistic features it considers, and/or what value it as- 
signs to any particular feature 

The following table shows the differing emphases of the 
three LPA tests. 



Table 1 
BINL BSM 



phonology 
morphology 
syntax 
lexicon 



(+) 



LAS 

-f 

(+) 
(-f) 



'118' 



The BINL measures lexicon only insofar as ii discounts 
non-English words used in an English-intended re- 
sponse in its use of a words-per-sentence count. The 
LAS measures morphology and syntax impressionlsti- 
cally, requiring the scorer to react in an impressionistic 
way to the frequency of deviations from standard En- 
glish syntax and morphology in the speaker's retelling 
of a taped story. Special attention is drawn in Table 1 to 
the fact that the BINL and BSM take virtually diametri- 
cally opposed approaches to the evaluation of syntax. ^ 

In considering the different content of each of these 
instruments, several questions come to mind. First, are 
the different components commensurate? That is. 
would we expect a score on one component to predict 
(have a direct relationship to) a score on another compo- 
nent^ If so. why^ If not. what are the bases for choosing 
or emphasizing one rather than another? 

From a linguistic point of view, we might expect that 
withm a given community most speakers acquiring En- 
glish as a first language would show similar levels of de- 
velopment in all components, especially within the age 
range of 1 0-^ 1 2. with which we will be dealing. Most fea- 
tures of phonology and especially morphology are rela- 
tively frequently used m speech, and most speakers 
would have adequate exposure to assimilate them One 
would expect syntax and lexicon to be more matters of 
individual experience To be sure, speakers of the same 
community should share the most obvious and usual 
syntactic patterns and typical vocabulary by this age 
i However, the more complex forms of syntax, found 
mostly 40 written English, eg. the preposit'onal relative 
clause — the boy to whom I gave the book (rather than 
- the boy (that/who) \ gave the book to — and special- 
ized vocabulary, e g , distributor (auto), beside 
(school-talk or archaic for r)ext to, by), etc . we might 
expect to be differentially distributed 

For acquirers of English as a second language, or even 
as a first langauge in a community in which the speaker 
IS exposed from the outset to non-English speakers or 
many speakers of English as a second language, the sit- 
uation may be quite different The components of Table 
1 are clearly separable Thus, phonology varies greatly 
across English-first communities as well as among non- 
first speakers, regardless of the other components. Lexi- 
con IS also highly vanakjle The most extreme cases of 
separation of lexicon from syntax are shown by Creole 
languages, which may have an English vocabularly. but 
a syntax and morphology quite distinct from the mam- 
stream varieties of English (cf Hymes, 1971) 

It IS far from clear at what rates different components of 
a language, acquired as non-first, develop, or to what 
extent there is any predictable relationship among dif- 
ferent components in second language development 
There is general agreement, based on observation, that 
by early adolescence phonology begins to be acquired 
more slowly than other components Some observations 



also sHow that social motiv^onal factors may inhibit 
syntactic development amongXadults (cf. Schumann, 
1 978; Herrdelberger. 1 978). \ 

Our interest at this point will focus\n the relation of 
morphological to syntactic developmenVamong 42 late 
preadolescent Hispanics (aged 10-12)^\in the Los 
Angeles County area, as elicited by segmentSi of the En- 
glish versions of BSM-1 and LAS-1. We wll\be con- 
cerned with the value of the data elicited bVthese 
instruments as indicators of English language abilitfes. 

3.0 EMPIRICAL STUDY OF THE LPA TESTS 

As part of a comprehensive study of the effect of topic 
and situation on language behavior, in the final phase. 
Individual students participated in a recorded language 
proficiency assessment interview with a bilingual 
female interviewer in her mid 20's. Each speaker was 
given sections of the BSM and then LAS, first in Spanish 
and then in English. 

The BSM-E (English) segment consisted of page 6 of the 
Child Response Booklet. The speaker looks at three pic- 
tures which form a coherent story (pictures 5-7) and Is 
requested to answer a number of questions. None of the 
speakers had any cognitive trouble interpreting the pic- 
ture story or answering the questions in at least one lan- 
guage (Spanish or English). The responses are controlled 
to the point that, in many cases, precise cross-speaker 
morohological data'were obtained 

The LAS-E segment consisted of listening to a recorded 
story following a traditional story-telling structure, al- 
though the content is original: 

Once upon a time there was a. . one summer 
day . the next day. . "I'm never going to drmk 
pink ink again/' (the end) 

The speaker is requested to retell the story This task 
elicits a unit of discourse from the speaker Although it 
IS not as controlled for precision as the BSM elicitation. 
It provides for a longer speech sample and a greater dis- 
play of a variety of morphological and syntactic points 
of interest. 

In examining the results of the BSM and LAS segments, 
we will not be directly concerned with the scoring ap- 
plied to the samples, since these have already been 
shown to be noncomparable by the research discussed 
above, but rather with the more general questions of 
comparability of morphology and syntax on which 
these and other LPA tests are based. 

3.1 BSM-E 

The serment of the BSM-E (English) includes five ques- 
tions (21-25) of which only the middle three are scored. 
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Although the BSM calls itself a Basic Syntax Measure, it is virtually confined to morphology Morphology may be viewed as word- 
vel. as opposed to sentence level, syntax 
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However for our purposes, the response to question 21 
is also.of interest. In a great many cases, the response to 
question 21 was along the lines of the dog Vi^NTS the 
food/to eat The interesting morphological point is the 
-5 marking subject-verb agreement. For speakers who 
created a response '^ith want, the traditional rule of 
subject-verb agre^ent was recorded as present or 
absent For some speakers, no data point was recorded 
because their responses did not contain an opportunity 
for subject-verb agreement e.g.. the dogs hungry 

The responses to questions 22 and 24 focus on irregular 
past tenses. The usual response to question 22 is the 
dog ATE it (- the food), and to question 24 is it the 
apple) FELL Speakers who used the verbs esf and fall 
either used the irregular pasit form or did not inflect the 
verb for tense. Since, unlike subject-verb agreement 
(referred to hereafter as 3S). irregular past tenses must 
be learned item by item. It will be interesting to note 
that a specific relationship between the two past forms 
obtains in the data. Finally, question 23 elicits the most 
interesting data for variability. The question intends to 
elicit an unreal (contrary-to-tact) condition: what would 
, have happened if the dog hadn t eaten the food? The 
' crucial variable is the structure of the auxiliary: would 
have In this analysis, we will attend especially to 
whether the auxiliary was a variant of would have (usu- 
ally woulda Of would of) OT somexhing else !e.g.. would 
will, was gonna, or nothing at all). 

Figure 1 below shows a clear relationship betvyeer> ac- 
quisition of the various-StruGtufe^and'AOA (age of arriv- 
al). Each point gives the percentage of speakers using 
the form indicated out of the number of speakers using 
the structures, i.e.. eat want. fall. 



Percent 

using 

form 




ate {N-2 1/6/7) 
wants {N-20/7/3) 
fell {N-2 1/5/5) 
would have {N-25/6/7) 



0-5 6-8 9+ 

Figure 1; Percentage Using the Indicated Morphologi- 
cal Form. When Appropriate, by Age of 
Arrival 

The figure shows that the unreal condition is the most 
discriminant structure of the four. Not even all of the 
earliest AOA group used would have in their responses. 
Of the two irregular pasts, fellis more discriminant than 
ate The 3S inflection falls in between the two. 

Diagram 2 below shows that there is actually a stable 
implicational hierarchy underlying the four features 
such that development of would have for the unreal 



conditicJn implies development of the other three fea- 
tures, and so^on until the indeterminacy between wants * 
and ate. 



(41) would have (21) would (12) other (8) 




Oiagraiti 2. (N) » Total number of forms in category 

(N) - Total number of pairs of forms for adja- 
cent categories 

According to the implication in Diagram 2: would have 
implies fell implies wants/ate. 

There is no obvious ordering between acquisition of 3S 
and the first irregular past verbs. For some reason, the 
past tense of fe//is less accessible to speakers than the 
past of eat (frequency?). At this juncture, it is worth 
noting that the most frequently used irregular pasts — 
was. went came got — are used in the LAS story retell- 
ing by a// speakers regardless of the implications of Dia- 
gram 2. This suggest thai acquisition of some past 
forms is relatively early, earlier than the initial develop- 
ment of 38. However, the irregular past forms are still 
spreading to other verbs as 3S develops. 

In any event, it is quite generally the case that acquisi- 
tion of the auxiliary structure would -f have (henceforth 
wda) by a speaker indicates that 3S and the past forms 
of a great many irregular verbs have already been devel- 
oped by the same speaker. It follows from this that at 
least for speakers of this age group, wda is a useful diag- 
nostic of a relatively high level of morphological ability 
in English. 

Before moving on to discussion of the LAS story retell- 
ing, it is worthwile to make one further observation 
about the structure of the unreal condition. This con- 
cerns the verb following wda. In the standard English of 
the classroom, wda. like the perfect, requires a following 
verb to be in participial form. e.g. eaten For the verb 
eat. in our sample, this was never the case. Table 2 
below shows that the norm for wda + eat is wda ate. 

Table 2 



would have -f 



AOA 


ate 


aten 


eat 


0-5 


9 


1 


1 


6-8 


0 


0 


2 


9+ 


0 


0 


0 
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In the communities of the speakers (LA Mexican- 
American), either the past participle of eat is ate (cf 
standard eaten), or speakers of this age group are still 
developing the dduU norm (eaten independently of 
first language background This is an empirical issue 
Until It is resolved, it is impossible to apply the BSM in- 
structions for scoring to this form, since the BSM expli- 
citly allows the use of nonstandard (local community 
based) features without penalty in evaluating language 
proficiency The trouble is that the BSM gives no list of 
such features to help the scorer — nor can it. given the 
present state of our knowledge of the conventionally 
nonstandard English of Hispanic communities. 

3.2 LAS-E 

As noted above, the story retelling presents a wealth of 
linguistic data. Foremost, it elicits a coherent multisen- 
tence unit (discourse unit which can be compared both 
to Its source (the recorded stOry) and to other replicated 
versions For present purposes, we are interested in the 
syntactic structure of the story as produced by each 
speaker Our aim is to compare\\o\N the abili- 
ties, revealed by our analysts of the LAS responses, 
relate to the morphological ab\\\\\es evident in the BSM 
responses 

For analysis of the syntactic structure of the stones, two 
variables were selected which figure in the scoring 
procedures of the BIN L"^ 

1 ) number of clauses/per story. 

2) number of types of clauses/per story. 

In Figure 2 below, the number of clauses in the stories 
are broken down by AOA group 
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Figure 2 



Clauses per Story 
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Although AOA showed a clear relationoiup to mor- 
phological development, it shows no clear paitern for 
story length. Most speakers of all AOA groups tell the 
story in 1 1-15 clauses before they return to silence. 

We might suspect that syntactic complexity would be a 
better measure of linguistic development than clause 
length among speakers who are already accustomed to 
producing multisentence units In everyday situations. In 
the present analysis, our interest in syntactic complexity 
will be restricted to clause types. In the stories, we sin- 
gled out seven different clause types used by at least 
two of the speakers, although the LAS source only used 
five (also represented arriong the seven). 

1 . n •¥ clause. This simple type of clause is introduced 
by n representing any form of the conjunction and), 
e.g.. 

it was a giant n he like uh she like uh pink lemo- 
nade, . (PC 12f. A0A6) 

2 (n) then -H cl. Another simple clause type is Intro- 
duced by either then, or n then, e.g . 

. . he ate uhm pink paint n then the next day he was 
sick. . .(CB 1 1f. A0A6) [not in LAS source] 

3, so (then) + cl. Another common clause type is Intro- 
duced by either so or so then. e.g.. 

. , he liked pink lemonade a lot. So one day he went. 
(MC 11m.A0A4) 

4 after/when + cl. A clause is introduced by when ox 
afteras a subordinator. 

\A/hen he dri-drink the water, he said, that 'S not a 
lemonade. (LG 12f. AOA10) 

5 if + cl. This clause type, introduced by if. only oc- 
curred in the story introducing an embedded ques- 
tion, e.g.. 

. he werst n to go^ee //he find lem- pink iemonade. 
iMtR 1 1 f, A0A5)^ [not ;n LAS source! 

6 relative clause. This typp of clause is irttroduced by 
///fV/f.r whcif\n the data, e g , 

Oni.>- tj^.f r, a time there was a- a sfily oUi fru ri^,ter 
th^i! filled to eat ptnk iemonade (BF^ \2n}. A0A9) 
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. . .one day he saw what he thought W2S pink lemo- 
nade. . .(EP 1 1 f, AOAO) 

7. any other clause. The most common clauses of this 
type are those with no introducing marker e.g.. 

.he said 0 he'll never drink pink ink again (JP 12m. 
AOAO) 

or reported speech introduced by that: 

. . .he ^ajd that he'll never eat pink ink again (AP 
12m, AOAO) 

or clause introduced by the conjunctions buto^ be- 
cat/se (both rare): 

. . .he went over to the ink but he didn't know (KR 
11m, AOAO) 

In Figure 3 below. AOA groups are plotted for clausal 
variety. There is no clear pattern discriminating the 
AOA groups. Generally, speakers tend to produce no 
more than five different clause types, without a clear 
preference between 2-3 and 4-5 types. 



100 -r 
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using number 
of clause 

^^^"^ - AOA94-(N.7) 

AOA 0-5 (N-28) 
AOA 6-8 (N-7) 

2-3 4-5 6-7 

Number of clause typas 

Figure 3. Distribution of Variety of Clause Types by Age 
of Arrival 

A companson of Figure 1 with Figures 2 and 3 shows 
that there is no clear relationship between morphologi- 
cal behavior and either length or clausal variety in dis- 
course units While AOA shows the development of 
morphological forms, it is indifferent to the types of 
syntactic tehavior we have investigated so far. 

It IS important to note m this context that our ap- 
proaches to the analysis of morphology and syntax are 
necessarily different We can easily compare a mor- 
phological form with the standard form, but we cannot 
make such a comparison for" syntactic variety since 
there is no clear standard for story length or syntactic 
variety It is far from obviOus that a speaker who uses 
only a few syntactic devices in the story retelling does 
so because he does not have adequate command of 
other syntactic devices We certainly cannot assume 
that each, speaker approached the story^retellmg task 
With the same enthusiasm, concern for length or syntac- 
tic variety - quite apart from the issue of English 
syntactic ability In another paper I will report on how 
length and clausal variety relate to situation by compar 




ing behaviors on the LAS with the same types of beha- 
viors in active competition' for the floor in peer group 
sessions. This is beyond the scope of the present effort. 

3.3 Further Observations on the Relation of Syntax to 
Morphology 

There are some final observations concerning syntactic 
complexity that are of interest. Table 3 below shows the 
number of speakers using each of the seven clause 
types at least once. 

Table 3 

n + (n) then so ithen)+ aft/whn if + rel 
Other cl cl +cl cl +cl d cl 



42 



39 



32 



18 



9 



19 



The RC (relative clause) structure is used by a little less 
than half the speakers, although it occurred three times 
in the source. Again, we nubt note that absence of the 
structure in the LAS does not necessarily mean lack of 
knowledge of the structure. However, an interesting pat- 
tern of use is observed by either AOA or AGE. Observe 
Figure 4, 
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0-&(N-28) 

10-11 

(N-5) 



6 8(N-7) 

11-12 

(N-16) 



9+ (N-7) 

12+ 

(N-21) 



Figure 4, Percentage of Speakers Using the Relative Clause 
Structure in the Story Retelling by Age of Arrival 
(solid line) and Actual Age (broken line) 

AOA shows a pattern of the depressed piiddle gcoup 
(AOA 6-8). This pattern is reminiscent of a pattern 
reported for several studies, but nost notably a Toronto 
study of immigrant students, by 3ummins (1981). In the 
Toronto study, a mixed batch of immigrants of ages 
T1-12 showed the f6llowing chc'ractenstics: using the 
Peabody Vocabulary Test (PVT) of English vocabulary 
development. Cummins notes that those who arhved at 
ages 2-3 showed more vocabulary development than 
those who arrived later However, those who arnved at 
ages 4-5 did not show more vocabulary development 
than those who arnved at ages 6-7 (in fact, they showed 
slightly less as a group). Thus, in that study, ages 4-5 is 
the depressed middle group with respect to the PVT 
since they do not show an advantage over peers who ar- 
rived at a later age Cummins suggests that there is an 
interdependence between vocabulary development in 
LI and L2 such that the more recent group (6-7) were 
able to trar^sfer the skills underlying vocabulary acquisi- 
tion in LI to'L2 and thus learn English vocabulary at a 
more rapid rate than the 4-5 AOA group, who one ^ 
would expect had a less developed vocabulary in L1 
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If a similar interdependence argument were adapted to 
explain the data of Figure 4. It would be that "some- 
thing" (an unclear factor) in the development of the 6-8 
AOA group's Spanish is either lacking or is not trans- 
ferred to. English, while for the later 9+ AOA group it is 
— and this "something" is responsible for the likelihood 
of using the relative clause in the story retelling."* 

However, the interaction of RC with age is just as strik- 
ing. The 11 -year-olds, as a group, use the RC much 
more often than either the 10- or 12-year-olds. Whatev- 
er the explanation, there is no incremental developmen- 
tal trend apparent, in contrast with morphology. 

Table 4 below shows th^t using wda as a diagnostic 
almost evenly divides those who used the RC and those 
who did not. ? 

Table 4 



This is in contrast to a language like Chinese (Standard 
Mandarin) in which the equivalent clause precedes the 
qualified noun and is marked by a particle (P) at its end: 

xudub (l^i Zhbnggub fSngwen) c/epengyou 
many come China visit P friends 
"many friends who come visit China" 

or Japanese, with a similar structure., but without any 
marker: 

(an^ta no t^beta) sakana wa 
you subj. ate fish topic 
"the fish (that) you ate" 



or Swahili. in which the equivalent clause follows the 
modified noun, as in English and Spanish, but the 
marker of the relative clase (RM) is embedded in the 
verb phrase: 





-l-RC 


-RC 


kitu (a-li-c/?okula msichana 








thing (she-Past-RM-eat girl 


-i- wda 


9 


12 


"the thing (that) the girl ate" 


- wda 


9 


11 





It IS clear that, for the age and grade'levels represented 
in this study, the syntactic and morphological measures 
derived from LPA tests are not comparable. Therefore, 
one should not expect any comparability of LPA instru- 
ments based on one and the other of these criteria. 

The evidence indicates that development of morphology 
is relativdy^traightforward for'preadolescents. depend- 
ing on lengtf\ of residence and/or exposure to Americanp 
schools.^ 



On the other hand, the development of syntax and its^ 
use in test situations are two separate matters. In solv- 
ing the problems of syntactic development and use. we 
will have to take into account situational variables of 
motivation as well as ability, and also the relation of the 
syntax of the first language to that of English. ^ 

In this context, we must note that the structure of the 
relative clause in Spanish is not appreciably different 
from that of Jnglish. owing to their common Indo- 
Europearr heritage. In both cases the relative pronoun 
immediately precedes the relative clause, e.g.. 

he didn't likeW/?a/(he drank) 

no le gustb /oTO6(comib 

Neg to-him pissed it-that (he) ate 

"he didn't Wke yifhat he ate"* 



The sintilarity of the English and Spanish relative 
clauses may allow easy transfer from one language to 
the other. The same kind of transfer potential is not evi- 
dent in morphology, e.g.. 

com-e: com-ib « eat-s: ate 
ca-e: ca-yb = fall-s: fell 
'^trobiera (Aux.-4- Perfect 4- Subjunctive)— would have 

We may expect that, except for those few cases of 
almost identical morphology, e.g.. the English and Span- 
ish regular plurals, the learning of English morphology 
is independent of language and shows an orderly pat- 
tern according to exposure (as reflected in our study in 
AOA). For syntactic structure, consideration of the first 
language is likely to be more important, but because of 
the apparent freedom of choice speakers have in select- 
ing devices, it is difficult to judge a speaker's syntactic 
abilities frorfj what he doesn'tsay in a test situation. 



^Further research will show that this/something" is not failure to have developed the relative clause structure in either language for 
any of , the speakers. 
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1 suspect Ihe stcond Plays a fairly minor role, but it is inseparable from the first in the data. 
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The Bilingual Education Project of the Ontario Institute 
forStudres in Education has been involved in evaluating 
French-immersion programs for the last ten years. The 
specific programs evaluated have been located in vari- 
ous parts of Ontario and share the following characteris-- 
tics: (1) they are optional: (2) they serve an English- 
speaking or majority language population; (3) the initial 
years of the program involve from one-half to a full 
school day in which instruction occurs in the medium of 
the second language; and (4) in principle, the curriculum 
content of the immersion program is similar to that of 
the regular English program. 

Canadian research studies on French-immersion pro- 
grams have, for the most part, been based on compari- 
sons^ of the academic achievement of immersion stu- 
dents with that of regular English program comparison 
groups. This emphasis can be explained by the need of 
educators to document that the educational experience 
of students in immersion programs was similar to that 
provided by the regular English program, and the desire 
of parents to be reassured in this regard. 

For the purposes of this paper, the question of academic 
achievement in such subjects as science or mathemat- 
ics is not podressed (but see Swain and Lapkin. 1981^ 
The issue o' the student's proficiency in their first lan- 
guage. English, is also not considered. Instead, the focus 
is on the second language proficiency: what is it? how 
has it been evaluated in immersion research studies? 
what kinds of conclusions can we draw from results to 
date? 

Before addressing these questions, it is important to de- 
scribe briefly two major French-immersion programs 
that we have evaluated. The most widespread immer 
sion alternative available in all ten Canadian provino 
IS the early total French immersion program. The lan- 
guage of instruction in both the half-day Kindergarten 
program and early grades is* French, with a daily period 
of English language arts being introduced in grade 2 or 
3. At grade 5, usually from 60% to 80% of the school day 
IS allocated to instruction in French, with this percent- 
age dropping to approximately 50% at subsequent 
grades At secondary school, early immersion foKow-up 
programs typically consist of about three subject op- 
tions (including French language arts) available in 
French for a total of 30% to 40% of the school day. 

"Late" immersion programs may begin in grades 6. 7 or 
8 The program evaluated b.y the Bilingual Education 
Project is located in Peel County (near Toronto) and 
begins at grade 8 with 55% to 70% of ^he instructional 



time in French. (Students entering the program have a 
background of short daily periods of French as a second 
language instruction in grades 6 and 7.) The intensive 
year of French at grade 8 is followed, at the secondary 
level, with several subjects available in French (history, 
.geography and French language arts) at grades 9 
through 11. In this particular late immersion program, 
no school subjects have been available in French at 
grades 12 and 13 except the traditional 40-minute 
French as a second language program in which post- 
immersion and non-immersion students have usually 
been combined in the same class. 

In the early evaluations of both early and late immersion 
program alternatives, the French language skills of the 
immersion students were compared to those of their 
peers in the regular English program who were studying 
French as a second language in 20 to 40-minute daily 
periods. In retrospect, it seems naive to have compared 
the two groups, since, to take a specific example, after 
one year of late immersion program. Peel County immer- 
sion students performed as well on tests of French 
listening and reading comprehension as regular pro- 
gram students studying French as a second language in 
daily periods who were two to three grade levels ahead 
of them. 

The comparison was also relatively meaningless in view 
of the objectives of French immersion programs with re- 
spect to second language skills. Educators and parents 
alike hoped that by the end of secondary school, immer- 
sion students (from both early and late programs) would 
be functionally bilingual. In 1974. the Ministry of Educa- 
tion for Ontario described, in functional terms, thek^- 
pectations for second language tise by immersion gracT^ 
es. It was expected that these students would be 
able to continue their education in French at^the post- 
secondary level, to accept employment where French 
was the working language and to participate easily in 
conversation. In other words, to a certain extent, the stu- 
dents' ability to speak, read, write and understand 
French would be native-like. ; 

i 

The relative comparison in assessing s^dond language 
skills would therefore best be made i/i relaltion to the 
native language abilities of francophone students of the 
sarrte^age and grade level. The extent to which we have 
been sljctressluMjvmaking such comparisons meaning- 
ful will be considerea^~after we have reviewed several 
tests and procedures used in the evaluations and the re- • 
suits obtained. \ 
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When immersion educations^vas first offered in Canada, 
there were no tailor-made assessment procedures (just 
as there were no suitable curriculum materials). One 
standardized French achievement tfesl normed on a 
population of native French-speaking students m Mont- 
real, had been developed by the Commissioh des Ecoles 
Catholiques de Montreal, This Test de rendement en 
Frangais proved useful in the French immersion 
context, because it provided native speaker data at 
each grade level against which immersion students' 
progress could be assessed. The test measures such as- 
pects of French achievement as identification of 
sounds, word knowledge (synonyms, antonynms, 
definitions), grammar (verb conjugations, number and 
gehder agreement, recognition of parts of speech), spell- 
ing and reading comprehension 

The kind of information yielded by using the Test de 
rendement en Franqais can be exemplified by results 
frorn its use m evaluating the early French immersion 
program By grade 1 or 2. immersion students score as 
well as about one third of their native French-speaking 
counterparts in Montreal. By grade 6, they score ^s well 
as one half of the Montreal comparison group. We have 
not analyzed the immersion test results diagnosticalty, 
that IS. to identify aspects of achievement m which their 
performance is relatively strong or weak m comparison 
to that of the French-speaking peer group 

To complement the information from such discrete- 
point achievement tests which place a distinct empha- 
sis on grammar we have designed measures intended 
to be more communicative. These include the Test de 
comprehension auditive and the Test de comprehen- 
sion de r^cntXo measure listening and reading compre- 
hension respectively. In developing these tests, an at- 
tempt was made to identify a number of real-life situa- 
tions in which immersion students might have contact 
with French speakers and to measure their understand- 
ing of the French used in each situation. Thus the listen- 
ing test consists of a number of radio tiroadcasts includ- 
ing news Items, sports items, weather forecasts, adver- 

; tisements and radio drama, each followed by one or 
• more questions with multiple-choice responses. Similar- 
ly. the reading test consists of a number of written 

' passages taken from newspaper and magazine articles 
and advertisements, comic strips, horoscopes, television 
schedules, recipes, and poetic or prose literature. 

We l^hink of these tests as integrative measures, testing 
not so much knowledge of a specific vocabulary item or 
grammar point, as the students* ability to derive mean- 
ing from the spoken or wntten text as a whole. Can they 
grasp the global meaning of such passages in the same 
way as they would quite naturally in their own mother 
tongue? 

A third integrative test used in the evaluations is a 
French cloze test, a wntten text from which every 
seventh word is deleted, with the initial and final para- 
graphs left intact to provide a context for the passage. 
This cloze technique is thought to provide an overall 
measure of proficiency since it reflects the students* in- 
tuitions and "educated guesses** about the second Ian- 
V> luaqe (Lapkin and Swam. 1 977. Oiler 1 979) 
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Two of the tests just described have been administered 
to early and late immersion students at grade 8, for 
example, and to bilingual francophone students at the 
same grade level. The results suggest that by the end of 
elementary school, the performance of early French- 
immersion students on the cloze test and the listening 
comprehension test is similar to that of grade 8 classes 
of bilingual francophones. Comparative statements in- 
volving francophone students must always be qualified 
in view of possible IQ, socioeconomic and other dff- 
ference between the groups. 

The use of these tests has also enabled us to compare 
early and late French immersion programs at grade 8 
when the late immersion students have had only one 
year of intensive exposure to French. As expected, late 
immersion students' scores are significantly lower than 
those of early immersion counterparts. In this case, the 
comparisons are statistical because we are confident 
that the relevant characteristics of the early and late im- 
mersion classes tested are similar as indicated, for 
example, by results from standardized IQ tests adminis- 
tered at grade 8 to both program groups. 

Since the first groups of early immersion students in On- 
tario have not yet reached the end of secondary school 
(at the time of writing they have complete(;i grade 10), it _ - 
has not been possible, to date, to compare late immer- 
sion results with those of early immersion groups at the 
end of their respective programs. With respect ta the 
question of how native-like the skills of late immersion 
students may be after several years in the program, we 
have been able to administer the listening test described 
earlier to bilingual francophone students at grades 10 
and 12. The results indicate that the bilingual francpph- 
one groups score significantly higher than the immer- 
sion students. From such results w0 infer that the late 
immersion students' performance after several years in 
the program is not yet native-like. 

Until this point, the test data presented have been in- 
tended to Hlustratethe kinds of comparative statements 
that can put immersion program results in perspective. 
With the exception of the cloze test, the tests that have 
been reviewed are measures of receptive skills. But how 
well can immersion students speak and write French? 
Further, the analyses reported are quantatitive. and no 
information has been presented on qualitative dif- 
ferences that may exist among the grou*" Moreover, if 
we express the goals of the immersion program in "func- 
tional" terms — that is. as noted earlier, that immersion 
should enable its graduates to converse easily in 
French, for example — what do we know about the pat- 
terns of French language use of the two immersion pro- 
gram groups? 

There are relatively few descnptive linguistic accounts 
of immersion students' spoken and written French. 
Those that are reported here relate almost exclusively 
to early French immersion programs. In one study of 
writing ability. Swam (1975) analyzed both English and 
French stories wntten by grade 3 immersion students, 
Companng the French stories of the early immersion 
students to their English stories, she found the French 

8/ 



stones to be shorter but equally varied in the use of 
vocabulary items (except prepositions); there were 
more gr^cnmaticat^rrors and fewer "technicar errors 
(in punctuatitm aruJ capitalisation), and more spelling 
errors The French storie^ also tended to be mere de- 
scriptive in nature than the English stories. It was clear 
that the French writing /of the immersion students re- 
jquired more attention, ^ince a second comparison be- 
tween English stories written by the same immersion 
students and regular 'program comparison students 
showed few difference$. * 

Using a French cloze test of the type described earlier, 
we also did a qualitative analysis of errors made by 
early immersion students at grade 5 relative to those 
made by francophone comparison groups, both bilingu- 
al and unilingual (Lapkin and Swam, 1977). The immer- 
sion students' scores were significantly lower than 
those of the unilingual compahson group. Some of the 
errors made were found among those made by unilingu- 
al French-speaking students. There was a striking simi- 
larity between the errors made by the bilingual fran- 
cophone students and the immersion students, suggest- 
ing that the immersion re^onses were qualitatively 
similar, and that the processing strategies used oy the 
immersion and bilingual francophone students might 
be similar 

Two studies of the total immersion students' spoken 
French have been conducted. The first (HarJey and 
Swam. 1977. 1978) involved interviewing five randomly 
selected grade 5 immersion students, three bilingual 
grade 5 children of French-speaking home background 
attending a francophone school, and three unilingual 
French-speaking children from Quebec City. The inter- 
views, conducted by a French-speaking adult who was 
not known to the children/were designed to elicit infor- 
mation on the verb system of the students. Among 
other things, the children were asked to narrate past ex- 
periences, explain current activities, and talk about the 
future: that is. they were expected to produce a variety 
of verb forms to realize specific semantic functions. A 
detailed linguistic analysis of the interviews revealed 
that the verb system used by the immersion students 
was simplified relative to that of the six comparison stu- 
dents. For example, immersion students were generally 
unable to produce conditional forms in French in re- 
sponse to such questions as: "Qu'est-ce que tu /era/5 si 
tu gagnais la lotene national?" Instead, the future might 
be used, thus removing the "hypothetical" nuance inher- 
ent in the conditional, or an adverb was used to convey 
tentativeness ("Je peut-etre acf^eier une voiture."). Thus 
immersion students were able to convey the essential 
message, but often used an inappropriate grammatical 
form to do so, The authors of the study concluded that 
there may be little need for the immersion students to 
acquire a completely native-like command of French: 
"Once the children have reached a point in their lan- 
guage development where they can make themselves 
understood to their teacher and classmates (as they 
clearly have), there is no strong social incentive to devel- 
op further towards native-speaker norms" (Harley and 
Swam. 1978:38) 
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IS important to bear in mind that the only sourcfe of 
itive French input in an immersion classroom is the 



teacher and that sustained interaction with francophone 
peers may be necessary if the immersion children are to 
attain native-like speaking abilities (Swain, 1978c). In 
another study by Szamosi. Swain, and Lapkin (1979). 
recordings were made of the conversations of a small 
sample of grade 2 immersion pupils who, on an individu- 
al basis, spent two hours each week for a total of eight 
sessions playing in the French-speaking homes of grade 
2 pupils who were attending a francophone school. A 
number of the play sessions were then transcribed and 
analyzed. The analysis was done in "functional" terms: 
that is, the purpose was to see how well grade 2 immer- 
sion students could use their spoken French in a natural 
context. It was found that the immersion students inter- 
acted with ease and naturalness and could joke, ask for 
clarification, issue orders, respond appropriately to the 
French-speaking playmate, and so on. The results of this 
study, along with those of the Harley and Swain (1978) 
study, suggest that early immersion students can 
convey a rich range of meanings in French, but that 
their way of doing so remains non-native. 

The "non-nativeness" of immersion students' spoken 
and written French can be described, but these descrip- 
tions do not address the question of how acceptable 
their spoken and written French is to the native French 
speakers with whom they wish to interact. One recent 
empirical study (Lepicq, 1980) has examined the ability 
of early immersion students at grade 6 to communicate 
effectively in French with bilingual and unilingual 
French students and adults. The study involves present- 
ing recorded speech samples from interviews with the 
immersion students and several unilingual French- 
speaking "control" students to a group of judges. The re- 
searcher interviewed the 96 judges (bilingual and uni- 
lingual, students and adults), to elicit views on the ac- 
ceptability of the speech samples. The judges tended to 
apply different criteria in evaluating the immersion and 
francophone students, demonstrating that their expec- 
tations differed for the two groups. In general, the im- 
mersion students were assessed favorably. Lepicq con- 
cludes that immersion students have sufficient ability in 
spoken French to communicate with francophones and 
identifies errors which are more and less irritating to 
native speakers in the speech of the immersion stu- 
dents. It is this kind of information that is important for 
curriculum planning and syllabus design, since empha- 
sis should be placed on those features of the second lan- 
guage which are considered most salient by native 
French speakers. 

A small number of questionnaire studies have been con- 
ducted to determine whether immersion students actu- 
ally put their French skills to use. Although relatively 
little self-initiated use of French (including listening to 
French on the radio, watching French television pro- 
grams, using French on vacations etc.) is reported, im- 
mersion students are willing to use French in interper- 
sonal communication. The extent to which French will 
be used in their adult lives in unknown, and it will be in- 
teresting some years from now to explore such ques- 
tions as whether immersion students choose employ- 
ment involvrng their second language skills to a greater 
extent than do non-immersion students. 
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The attempts to describe the second language proficien- 
cy of immersion students have been few* in part, be- 
cause Immersion is still a relatively recent phenomenon 
and because descriptive analyses of the sort reviewed 
above are time-consuming and expensive. The chal- 
lenge to those of us working in this area is to cooperate 



more closely with mother-tongue specialists (linguists, 
psycholinguists and sociolingulsts) so that theories of 
first-language development and information on aspects 
of first-language proficiency can be used to establish 
criteria against which second-language skills can be as-, 
sessed in meaningful ways. 
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Introduction 

A search of the literature regarding |c^rrTializedbfltngMal 
teacher training programs in insiituwps of higher edu- 
cation will reveal that the area of langtiage assessment 
of prospective candidates has remained a riqh arena for 
research and model development. Virtually no studies 
have been conducted on the estabHshment of 
entry/exit-level criteria and assessment practices in 
bilingual teacher training programs (recent research has 
appeared by Seidner, 19B1 a; Binkley Johnson, et al., 
1981). In contrast, an overwhelming emphasis of educa- 
tional research has been placed upon these issues for 
limited-English proficient populations in grades pr^-K 
through 12. For example: Cummins, 1979, 1980; Dulay 
and 8urt. 1980; Matluckand Mace, 1973 (also Matluck, 
1978); Estes and Estes, 1979; Maulden, 1979; Hardy, 
1977; Cornejo. 1978; Curtis, Lignon, and Weibly 1980; 
and 8alasubramonian, 1979, among others). We begin 
this study with an overview of issues as they pertain to 
the topic. This will be followed by an analysis and dis- 
cussion of relevant data from a national survey. Finally, 
the reader will be presented with models which pertain 
to the sample, conclusions, and recommendations. 

A number of earlier investigations implied the need for 
such criteria, through the incorporation of particular lan- 
guage competencies by teacher training programs (for 
example Sanchez, 1934; 8lanco, 1975; Andersson and 
Boyer, 1970; Cordasco, 1976; U.S. Commission on Civil 
Rights, 1973; Flores, 1973; Valencia, 1970; Also Viera, 
Squires, and de Guevara, 1975, also see Michigan 
Department of Education, 1976; Illinois Board of Higher 
Education 1974; Texas Education Agency, n.d; National 
Consortium of Bilingual Education, 1979). The impor- 
tance for considering the implementation of entry/exit 
level critena for language assessment was underscored 
in Gaarder's less than veiled warning that the majority 
of bilingual teachers were inadequately prepared for 
classrooms (1970) Based upon earlier analysis of exist- 
ing bflmgual programs, the federally sponsored Mary- 
land Conference of Bilingual Education offered a broad 



range of recommendations with implications for such 
criteria. Many of these recommendations were to 
^M;emain unimplemented during following years (cited by 
MeCaulay and Ramirez, 1 977). 

During"H9^-76, a survey conducted by the National 
Center for Eacrsalional Statistics found 218 colleges or 
universitiee offering ^ome^orm of bilingual teacher 
training. (Unfortunately, the second phase of this study 
as a comprehensive report remains on magnetic tape. 
For a discussion, see Waggoner. 1978). Basic federal 
grants during this fiscal year amounted to $53 million 
for 207 institutions of higher education. 

An analysis by the former Department of Health. Educa- 
tion and Welfare of Title VII teacher training programs 
revealed needs in areas of second language acquisition 
and bilingual subject methodologies (Department of 
Health. Education and Welfare. 1976; also Dissemina- 
tion and Assessment Center for Bilingual Education. 
1978). However, the analysis failed to address areas of 
language assessment and development for potential 
teacher candidates, especially in the establishment of 
entry/exit-level criteria. The impression left from the 
onset was one of colleges and universities mobilizing 
their efforts to obtain coveted federal funds, rather than 
formulating long-range training programs, including 
systematic criteria for entry/exit levels of target lan- 
guage(s) assessment. The importance of this theme is 
revealed to the reader apropos to the topic through the 
course of this paper. In recent years. MacCaulay and 
Ramirez. (1977) underscored the urgency to carefully in- 
vestigate the necessary requirements for teachers in 
bilingual education programs (also Ramirez. Gonzalez, 
et al.. 1974; Sutman. 1979; Arciniega. 1977; Castillo. 
1976; Carillo. 1977; Gonzalez. 1979; Rodriguez. 1980). 
As colleges and universities developed bilingual de- 
grees, two concerns relative tO this .exigency were 
identified; 1. Higher Education Institutions simply rela- 
beled existing courses as bilingual and regrouped them 



The preliminary data of this study was presented earlier in an overly abbreviated fonm, due to editing 
constraints, in Padilla 11981). Unfortunately, the author was not consulted regarding substantial editing 
changes. 
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into a slightly different pattern The inference, then, is 
one of a shor»-term economrc expediency perhaps 
taking Dre(.edHr»rR over a long range of moral commit- 
rTu»ru \u the ur^}(\T.in\ 2 Seldom were new programs 
hiring new tacuity to toach courses NA/hich were ap- 
propriate to bilingual teacher preparation Furthermore, 
potential chasms existed between suggested remedies 
offered by scholars, and the lack of implementation by 
colleges and universities Formulations by other researc- 
hers regarding target competencies for training pro- 
grams, articulated desirable linguistic skills, but also ne- 
glected to investigate the nature of entry/exit-level 
criteria in language assessment by institutions of higher 
education For example, the guidelines promoted by the 
Center for Applied Linguistics (1974) suggest a variety 
of competencies, but neglect to suggest a manner of 
implementation by cx)lleges and universities in estab- 
lishing points of entry and exit Similarly, competencies 
.jvere offered in a specific or more general fashion from 
researchers and organizations without ber>efit of an ap- 
propriate data base regarding ongoing practices by 
institutions 'Palmer 1975. Blanco. 1977; Martinez. 
1975. Castillo 1976; Brisk. 1978; Board of Directors. 
NYSUT. 1975. Bilingual Education Service Center. 
1977) Suggested competencies from Acosta and 
Blanco (1978) strongly recommended an exit-level lan- 
guage proficiency test which would include the evalua-J' 
tion of pedagogic competencies "in both languages" 
Inherent in the recommendation was some assessment 
process which would logically begin with potential stu- 
dents' onset OT studies, hence, the need for entry-level 
criteria In contrast; then, with its definitions and applica- 
tions to hmtted-English proficient students in public 
school, the terminology of entry/exit-level criteria for 
language asF.essment will be generally viewed for teach- 
er training institutions in terms of overall preparation 
processes. The two impdrtant aspects for our purposes 
of this viewpomt include the actual acquisition and 
manipulation uf target language skills in subject areas 
(contextual basis) Both features of remedial and con- 
textual basis Will be discussed m another subsection 
relative to suggested model characteristics (It is impor- 
tant to note that a lack of consensus m agreement 
arTKjng c:oileijf-i and universities pertaining to the estab- 
lishment of (criteria and assessment practices is evident 
with recent data from a study conducted by the RMC 
Corporation (Binkley. Johnson, et al . 1 981 ). 

Particular emphasis ts warranted regarding the nature 
of entry/exit -level criteria and quality of assessment pro- 
cesses, given recent studies on the relationships of 
teacher language to academic achievement (Merino. 
Pohtzer. and Ramirez. 1979; Merino and Politzer. 1977: 
Ramire?. 1979. 1980. Seidner. 1981b. also Bolger. 
1 967) It would follow that the lack of substantial critena 
levels and assessment processes by training institutions 
would '/jltimately have some degree of negative bearing 
on the overall quality 6f teacher language. We find sub- 



stantiation for this statement in the National Assess- 
ment Survey of Title VII ESEA basic programs which 
provided additional data through responses of surveyed 
teachers and directors (Dissemination and Assessment 
Center for Bilingual Education. 1978). The greatest inci- 
dence of need for teachers and directors corresponded 
at very high levels in areas of teaching content areas 
bilingually (Directors (n - 154). at 89; Teachers (n - 
521). at 245) and bilingual teaching strategies (Teachers 
(n « 534). 242; Directors (n » 153). 85). Consistent with 
these responses, the National Center for Education 
Statistics reported a majority of teachers who were 
rated as Spanish/English bilinguals in their national 
survey, with language skills not derived from profession- 
al training programs. Only approximately one third of 
this number (n -= 14.000) were rated as having crucial 
academic preparation to teach subject content areas to 
limited-English proficient students (Waggoner. 1979: 
also 1978; in addition, see National Advisory Council 
for Bilingual/Education. 1981). The National Center for 
Educational Statistics is currently sponsoring another 
nationwide survey, similar to the one of 1977. (On this, 
see Chamot. 1981.) An earlier study by Frank Ramirez 
(1974). found that 20% of bilingual teacher applicants in 
the Los Angeles Unified School District failed the lan- 
guage fluency test (also Migdail. 1976). A survey was 
conducted by the bilingual teacher training unit of the 
New Mexico State Department of Education the year 
before. It was found that 10% of their sampled 136 
bilingual teachers and aides could function at a third 
grade level when administered a reading and writing 
test in Spanish. None were able to show acceptable pro- 
ficiency at the fourth grade level (Pascual. 1 979). 

This brief overview of issues underscores the need to 
raise a number of questions, especially m regard to 
entry/exit-level criteria in language assessment, which 
- remain largely unanswered. The recent study conducted 
by the RMC Corporation provides us with a valuable 
general overview of current practices. Norietheless. no 
comprehensive data base has been assembled in re- 
sponse to some of the following concerns: Who makes 
decisions regarding the establishment of criteria for 
entry/exit-level language assessment in training pro- 
grams and choices of instrumentation? Who conducts 
the actual assessment of target languages? Are there 
relationships between the exoerience of decision 
makers in programs and choices? To what extent are 
public school, community sectors and faculty other 
than immediate program personnel included in the 
decision-rriaking process? To what extent are oppor- 
tunities offered to upgrade target language skills? What 
are the relationships between these opportunities to 
use target languages in educational experiences? To 
answer these and other concerns, we will continue with 
an examination of data from a national survey of bilingu- 
al teacher training programs in the United States. 




I w6Uld like t-; thank Dr Donna Johnson from the RMC Corporation 3nd Dr Richard Baecher from Fordham University for sharing 
this data 

•L^^tinqfT^iThilir.qajl teacher training institution^,, both public and private, from which the sample was randomly selected originated 
from compilations by government and other sectors (for example. Dissemination and Assessment Center. 1 979; U S ^f ^^Jl^^^^ 
Educatior. 1081 National Clearinghouse for Bilingual Education. 1980a, 1980 b. etc) A distinction is made m this study be ween 
•/istitutions and programs, smce a number of colleges and universities offer more than one on graduate and undergraduate levels. 
>"iee footrititn H of this paper for additional information 
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Overview of the Study 

A survey of 127 items was mailed to 93 randomly 
selected teacher training institutions tn the United 
States ^ The final report of this study reflects a return of 
69% (n-64). In addition, approximately 10% of respon- 
dents (n—9) indicated that their programs were disconti- 
nued, accounting for a total of 79% of the mailings. The 
percentage of return is considered more than adequate 
for potential analysis (See. for example. Sabbie. 1979; 
Shannon. 1948). Responses arrived from programs 
located in 23 states. Guam and Puerto Rico. The survey 
was addressed to Program Directors with provisions for 
collaborative answers by other program personnel. The 
final analysis showed that 90.6% (n«58). were complet- 
ed by Single individuals. The remaining 9.4% (n— 6) were 
completed by more than one person. Only 12.5% (n-8) 
of responses completed by single individuals were done 
by personnel other than the program director. General 
data describing years of program existence and the 
number of immediate ^acuity are listed below in Tables 
1 and 2 The mean number of years for program exis- 
tence was approximately five-and-a-half years. 



Table 1 
PROGRAM EXISTENCE 

ITEM n=64 •/• X s 



5391 2.646 

No of years 

1-4 22 344 

5-8 33 51 5 

9-13 9 141 



Table 2 
PROGRAM FACULTY 
ITEM n=64 % X s 

4.500 3.086 

No. of Faculty* 

1-4 35 547 

5-8 24 37.3 

10-18** 5 8 



•Immediate program personnel/full and part-time equivalents 
equated against individual institutional standards. 

••Reported numbers were: 10 Faculty (n«2). 12 (n-1). 15 
(n«"1), and 18 (n«"1). Secretarial and similar support person- 
nel were excluded. 

Program pnode for faculty was 5 employees for respon- 
dents (n«-ia 29.6%). Another 20.3% (n-lS) cited 2 
faculty members, while 28% (n— 18) responded with 3 
to 4 personnel (14% for each category). The mean 
number of faculty per program was four-and-a-half 
(given the marginal division of a human being). The 
large standard deviation (s) reflected the wide range of 
numbers away from X. In contrast, the sample from the 
RMC study revealed that the majority of bilingual 
courses were taught in most programs by the combined 
categories of 3-6 faculty members. The overwhelming 
number of personnel were drawn from the education 
department of colleges and universities throughout un- 
dergraduate and graduate levels, followed by foreign 
language sectors and the social sciences (8inkley. John- 
son, et al.. 1981). Of the 64 responses. 18 private and 30 
public institutions offered bilingual teacher training pro- 
grams for bachelor's degrees and undergraduate level 
credenttals. (See Table 3) 

Table 3 



BILINGUAL TEACHER TRAINING PROGRAMS 
OF RESPONDENTS* ln=64) 



Program mode for respondents was 5 years of existence 
(n«23). 35 9% Only a small percentage of respondents 
(n=«2) cited 1 year or less of existence for their 
programs The program mean for years of existence cor- 
responds with those from the aforementioned RMC 
study of 56 institutions. Prior months in compilation of 
their data account for the minimal difference in RMC 
and current statistical descnptions. This brings us to the 
number of faculty members serving in programs. Due to 
difficulties of interpretations of full-time and part-time 
equivalents, variations in definitions from one institution 
of higher learning to another, and differences brought 
about by semester, quarters and trimester divisions, im- 
mediate bilingual personnel were equated as such 
against individual institutional standards. (See Table 2 ) 



1. 2. 3. 4. 5. 

Assoc. Bachelor's Master's Post Doctoral 

Program {2 yrs.) Credentials 

•/. •/. % •/. •/. 



Private 

Institutions 

39%(n-25) 

Public 

institutions 

61%{n-39) 



64 
(n-4) 



32 
(n-20) 



52 8 
(n-33) 



32 
{n-20) 



43 2 
{n-27) 



48 4,8 
{n-3) (n-3) 



16 16.0 
(n-1) (n-10) 



•Colunnns 3, 4. 5 include progranns offered by respondents 
cited in Colunnn 2 (Bachelor's/Credentials level). As a result, 
the nunnber of progranns is apparently larger than the number 
of training institutions. Percentages (%) are based upon the 
sannpleof 64 responses (100%). 



^/^/eaknosses of questionnaire research have been pointed out by Rueknnick who grudgingly conceded the value of this approach in 
education Franzen and Lazarsfield (1945) concluded that the use of nnailed questionnaires yielded nnore data which was nnore relia- 
ble than interviews (also see Topp and McGrath. 1950. Phillips. 1951 Conrad 1 960. Young. 1966, Rosenberg. 1968. Babbie. 1979) 

*A study by See (1957) relates that a greater nunnber of questionnaire returns were obtained when the original request was directed 
Y o the administrative head of an organization 
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Three of the respondents representing private institu- 
tions cited current teacher training programs only on 
master's and doctoral levels (Another private institution 
responded with a Master's in Education program). One 
public institution claimed only a master's program 
while two referred to master's and doctoral programs. 
There were no discernible patterns which emerged in 
pnvate as opposed to public training institutions. Of the 
responses. 65 5% of the sample (n-42) were subsidized 
by federal and college/university funds Only 12.5% re- 
ceived exclusively federal funding (n-«8) In comparison, 
3.2% (n-2) were completely funded by the respective 
college/university Another 16% (n«1) were completely 
subsidized by the state The remaining 17 2% (n-1 1) re- 
ceived funds from other sources (e.g. combinations of 
the state/federal, state/college, university/private 
sources, etc ) Private institutions characterized the last 
three categories, while the majority of public institutions 
offering undergraduate programs typified that of exclu- 
sive federal funding. The majority of programs receiving 
exclusive ur combined categories of federal funding and 
colleges/universities were situated in states with special 
requirements for bilingual certification, the largest, per- 
centage of respondents, including California (15.6%). 
Texas (10 9%). Illinois (10 9%). New Jersey (7.8%). and 
New York (15 6%) Although New York State lacks a par- 
ticular mandate, the majority of the responses arrived 
from New York City which conducts bilingual licensing 
of personnel 

As part of our overview, the 64 respondents reported 
the following categories of langauge assessed m their 
training programs (See Table 4) 

Table 4 

LANGUAGES ASSESSED BY PROGRAMS 



CATEGORY n=64 



Cum. Freq. % Mode=3.000 



42 65 6 65 6 
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1 English Only 1 16 16 

2 NonEng Lang 13 20 3 203 

3 Eng arid ono 
other target 
language 

4 More than one 
non English 

target language 4 6 3 6 3 

5 None 4 6 3 6 3 



The none (no assessment procedures) -category included 
a Chamorro English training program and a private insti- 
tution receiving no federal funds. Although category 3 
reflected an overwhelming inclusion of Spanish, other 
languages such as Yupik were included m the data tabu- 
lation The majority of the institutions receiving exclu- 
sive or partial federal funding, assessed one target lan- 
guage in addition to English in their bilingual teacher 
training programs The second largest category, the as- 
sessment of one target language exclusive of English, 
reflected a majority of programs receiving some degree 
of federal funding This brings up a question, how did as- 
sessment procedures of programs with more than twg 



target languages differ from those with only two? Dif- 
ferences were apparent in assessment approaches, pro- 
cesses and choices of instruments by programs with' 
more than two languages. In brief, the analysis of the 
data shows an increasingly heterogeneous pattern of as- 
sessment approaches for programs with more than two 
languages. More observation, interview and question- 
naire techniques were employed in combination as the 
number of target languages increased. There was a 
greater reliance upon sectors other than the immediate 
program with the increase in languages, as well as the 
participation of public school personnel and community 
resources. It is interesting to note that the RMC sample 
revealed a smaller enrollment and reduced curriculum 
for target languages other than Spanish and English. 
These developments may be reflective of an overwhelm- 
ing emphasis of funding, resources and program devel- 
opment for Hispanic ethnolinguistic groups. Having pre- 
sented a general overview, we will now turn to some 
other specific aspects of the study, appropriate to ques- 
tions raised in the introduction. 

DECISIONMAKERS 

Who makes the actual decisions in determining 
entry/exit-level criteria for assessment of target lan- 
guages? Only 18.8% (n-12) of the respondents cited 
the director of the program as the sole decision maker 
(It is appropnate to note that the RMC investigation 
found that 28% of their sites reflected one or two key in- 
dividuals who significantly affected the operation of the 
program). The largest incidence occurred with 40 6% of 
programs reporting decision making by immediate 
faculty. The next highest incidence was found at 26.6% 
of programs in which bilingual faculty from sectors 
other than their immediate program (e.g. foreign lan- 
guages, language testing centers, etc.) were involved. 
The remaining 14% consisted of shared decision 
making by these others in conjunction with program 
directors. (One program which reported no assessment 
procedures responded with the director as decision 
maker in a hypothetical situation of near-future need.) 
Only 15.6% (n«10) of the respondents reported better 
than average participati(;/n c5f public school personnel in 
their decision-making process other than English. At 
the same time 15.6% (n-10) reported average participa- 
tion. Finally. 54.6% (n-35) reported no participation, 
while another 11% cited involvement of public school 
personnel as low (based upon a total n«64 with 2(3.2%) 
missing cases). A similar pattern of responses evolved 
for the processes of involvement in establishing English 
language criteria. 

Setter than average community involvement (very 
high/highest ratings) accounted for 25% (n*16) of the 
respondents; 28 1% (n-18) reported average 
participation. A remaining 17.2% (n-?11) claimed low 
participation, while another 20.3% (n^ 13) reported no 
involvement; 9.4% (n«6) oi the 'ca^es were incomplete 
for consideration. The perceptually higher percentage 
of community may be explained, in part, by the exis- 
tence of advisory boards from a number of institutions. 
Advisory boards, however, may also be mostly token 
representatives of participation in order to meet any 
funding commitments. Some writers m the field have 
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suggested benefits of community involvement in train- 
ing programs (For example. Brisk. 1978; also implica- 
tions from Resnick. 1976; Carrillo. 1977; Barrera. 1973; 
De Inclan. 1976. and Pascual. 1979). It is interesting to 
note that 15.6% (n«10) of the target programs reported 
a better than average student participation in establish- 
ing language assessment criteria for entry/exit levels. 
Some 21 9% (n-14) cited average, while another 20.3% 
(n-13) were low. 40.6%(n>-26) responded with no par- 
ticipation and 1.6%(n-1) of these cases failed to 
respond. Student participation was higher than antic- 
ipated and may be due to opportunities of internships, 
scholarship and fellowship roles. This brings us to the 
question of who conducted the actual assessment of 
target languages? 

Less than 10% of program directors unilaterally con- 
ducted assessment of target languages. The incidence 
constituted less than half of the respondents reporting 
unilateral director decision making regarding entry/exit- 
level criteria This type of assessment conduct was 
found in programs characterized by two target lan- 
guages, less than the reported mean number of years of 
program existence. Some association was found be- 
tween assessor(s) of languages and the decision makers 
for entry/exit-level criteria (p<.05. rg-.38). nonetheless, 
not high enough to make an absolute judgment. The lar- 
gest number of respondents occurred where assess- 
ment was conducted by all bilingual program faculty. 
(See Table 5) 

Table 5 

CONDUCTORS OF LANGUAGE ASSESSMENT 



interdepartmental relations arose between bilingual 
and foreign language sectors (Binkley. Johnson, et al.. 
1981). This was attributed to the incompatability of phi- 
losophies regarding the importance and nature of stu- 
dent proficiency in target languages and/or competition 
for student populations. In support of this premise, only 
8% of Amehcan colleges and universities required a for- 
eign language for admission during 1979. compared 
with 34% in 1966. Federal support for foreign language 
programs during this year (only 10%) was lower in real 
terms than in the 1960's. Private foundations have with- 
drawn almost completely from the field (President's 
Commission of Foreign Language and International 
Study. 1979). As a consequence, potential struggles be- 
tween bilingual and foreign language sectors may tran- 
scend real issues of economic need and surface as the 
symbolic control of testing processes. This control 
would portend eventual funrieling of students into one 
series of courses by one sector as opposed to another. 
Respondents for this study who indicated a large 
degree of institutional control over target languages 
mentioned such instruments as the M.LA. test and the 
TOEFL which are popularly utilized by Foreign and En- 
glish Language sectors. However, it was difficult in 
other conditions of control over assessment processes 
as reported by respondents, to distinguish between 
individual, program and institutional choice since a high 
degree of probable coincidence existed. 

Given this pattern of involvement, what were selection 
choices of decision makers regarding assessment? In 
ascertaining entry/exit-level criteria for English and 
other languages, respondents chose the following kinds 
of assessment for program preference. (See Table 6) 



CATEGORY n«62* % Cum.FrBq.% Mode«5.000 



1 


Program Director 


6 


94 


9.4 


2 


AM Bilingual 
Program Faculty 


22 


344 


355 


3 


Some Bilingual 
Program Faculty 


4 


63 


6 5 


4 


Faculty from 
Other Sectors 


7 


10 9 


113 


5 


Bilingual and 
Other Faculty > 


23 


359 


37.1 



•Two missing responses refer to two programs reporting no 
formal assessment processes. * 



The second greatest occurrence consisted of bilingual 
and other faculty. Respondents largely identified "other 
faculty" in terms of personnel from English language 
centers and institutes. A number of respondents (6.3%) 
mentioned community resources participating in the 
actual assessment processes. Another 3.1% reported in- 
volvement of public school personnel The brief occur- 
rences were mostly found m programs reporting more 
than two target languages. Faculty from other sectors 
reflected involvement from foreign language depart- 
ments and English language centers. The domination of 
assessment processes for students in bilingual pro- 
grams by these sectors may be a considerable concern, 
^^^.scertained from a number of comments volunteered 
sJXJy respondents. We also ftnd evidence that the poorest 



Table 6 

PREFERENCE FOR LANGUAGE ASSESSMENT 



H. Other 

1. English Adj.* Lang. Adj.** 

ASSESSMENT (n«64) % (n-62) (n«62) % (n«61) 



1 


Commercial 
Instruments 


9 


14 1 


14 5 


6 


9.4 


9.8 


2 


Noncommercial 

Field-Tested 

Instruments 


4 


6.3 


65 


4 


6.3 


64 


3 


Informal lr)ter- 
views 


5 


7 8 


81 


3 


47 


49 


4 


Obsfirvations 


2 


3 1 


32 


2 


31 


3.3 


5 


Other"* 


42 


656 


677 


46 


71 9 


754 



*2 cases Missing Mode I ^ 5.000 
••3 cases Missing Mode I) - 5.000 

***lncludes combinations of the above or other instruments 

Compared to these responses, the RMC preliminary 
data indicated that the interview was the most often uti- 
lized by their sample for target languages other than En- 
glish. Category Other reflected a majority of combina- 
tions which included informal interviews and/or obser- 
vations. The largest incidence of program preference in 
the higher than average categories for languages other 
than English (n-37. or 57.8%) also occurred with inter- 
view scales. Comparative data from respondents for dis- 
crete point tests show a higher than average program 
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preference of 47.4% (n-27). and a 34.4% (n-22) for 
criterion-referenced tests. Questionnaires rating 22.1% 
(n-13) patterns of program preference were somewhat 
similar in the assessment of English. Some indicatiori of 
particular use of instrument emerged, such as the FSI 
and the M.LA. test (on these and other instruments, see 
Clark. 1978; Oiler and Perkins. 1980). The contention 
that these patterns may result from locally developed or 
state developed procedures finds some strength in cor- 
roboration in RMC data. The RMC study further found 
that their sample failed to establish specific criteria for 
assessing student's language proficiency, accordingly, 
students were expected to be fluent in target languages, 
although no definitions were established in regard to 
the nature of fluency. Most programs saw no need for 
English requirements beyond those of the university as 
a whole (since the majority of students were educated 
in the United States and considered fluent in English). 
Assessment of students at exit levels occurred more fre- 
quently before their graduation from institutions in 
states with certification requirements in bilingual edu- 
cation. The reverse occurred in states without certifica- 
tion requirements. No general pattern appeared among 
respondents of this study among states with certifica- 
tion requirements. The use of a measure such as the FSI 
appeared stronger in one state, for example, while it 
was weaker as a recommended approach in another. A 
forthcoming discussion will provide a recapitulation 
and summary of these and other points. . For the 
moment, we will examine relevant issues of remediation 
provisions and opportunities to utilize target languages. 

Turning to concerns regarding availability of remedia- 
tion courses in target languages, we find that only 3.1% 
(n-2) of respondents ranked available opportunities in 
English as low or none, as compared to cumulative 7.8% 
(n-5) for other languages. Another 67.8% of respon- 
dents (n-43) reported better than average opportunities 
for English language remediation, in comparison to 
62.7% for other languages. 

The highest opportunities for utilizing languages in aca- 
demic settings went to English in contrast to other 
target languages (84.24% (n-54) in English compared 
to 46.8% (n-30) for other target languages). In order to 
ascertain relationships of language remediation oppor- 
tunities to those of utilization, both Kendall's Tau 



2 N(N-^)-l^ 



/V(/V~1)-T 



and Spearman's r-, (sj 



Rho r ^ 

s 



T + T 

X Y 



1/2 



were adopted as rank order coefficients. The level of 
, confidence p<;05 was selected a priori. Spearman's Rho 
yielded a closer approximation to Pearson's product- 
moment correlation coefficients as the data was more 
or less continuous Although Kendall's Tau is usually 
more meaningful with a larger number of tied ranks, a 
D I /^"Correction for ties overcomes the potential shortcoming. 



Table 7 \ 

RELATIONSHIP OF LANGUAGE REMEDIATION 
OPPORTUNITIES TO LANGUAGE UTlLIZATfbN 



RANK/TIES 


N 




, sig.* 


7 


sig.* 


English Language 

A. Opportunities 

B. Use \ 


60 


.3574 


.048 


io46 


.021 


Other Languages 

A. Opportunities 

B. Use 


5& 


.3612 


.006 / 


' .3187 


.001 



•p<.05 

Given the levels of confidence, a substantial degree of 
correlation is apparent between English/other language 
remediation opportunities and tHjpir counterpart ties for 
utilization. (See Table 7 above.) 

Levels for remediation opport'/iuties. r^iked from r me 
through highest, paralleled those for utilization of tc.get 
languages ir> subject cl^sses^ Opportunities for upgrad- 
ing language proficiency rnatenalized in t1r.e following 
manner. Remedial coursework was provid^ in the 
target language to upgrade competencies. Subiect-arda 
coursework was presente)fl in target languages. Pi^ld ex- 
periences were provided for students with the oppor- 
tunity to utilize target languages. Programs which\x- 
ceded two target languages, appeared to rely less upon 
utilization of target languages in formal cidss surround-^ 
ings and more upon /field experiences. The data sub- 
stantiates the conterttion from the RMC study that the 
utilization of languages in classrooms depended largely 
upon the instructors facility with the languages and 
with the number of/target languages in the progranri. As 
the number of ifarget languages increase, across- 
the-board opportunities for remediation seemed less 
than provided bWprograms with only two. It is appropri- 
ate to note that 84% of all programs in the RMC sample 
required their students to take one or more remedial lan- 
guage courses; Another 26% of the programs recom- 
mended coursiwork. but did not require the student to 
take it. As the academic level (i.e. masters, doctorate) of 
the program increased, the number of required courses 
was found to cJecrease. Given the nature of this study, 
this author's survey could offer only general substantia- 
tion of this tendency in this instance, rather than a 
definitive statistical comparison. No pattern appeared in 
the preliminary ^tabulations to indicate correlations be- 
tween opportunities for remediation and types of as- 
sessment instruments. However, a higher correlation ex- 
isted between the nature of the ultimate decision 
maker(s) for establishing language assessment criteria 
and opportunities in utilizing target languages other 
than English language skills (n-57, r^ - .5459, p<.05, 
reported sig. .001). The hifih incidence of group decision 
making (re. the immediate program with outside facul- 
ty), corresponded with high occurrences of opportuni- 
ties for utilization of these languages. 
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Experience, Academic Preparation and institutional 
Commitment 

An analysis of data shows the directors of programs 
averaging a little uv«r 7 years of teacher training experi- 
ence on college and university levels Interestingly, the 
average number of these years in bilingual teacher train- 
ing for directors was less than 5 1/2 years. Throughout, 
the standard deviation indicates a rather large distribu- 
tion spread regarding experience and academic prepara- 
tion Overall, teaching m public schools for directors 
averaged less than 5 1/2 years, of which less than 5 
were in bilingual education. (See Table 8.) In compari- 
son, faculty in bilingual programs averaged less than 5 
1/2 years in teacher training. (Note mode of 2 years 
overall experience and only 1 year in biliagual teachSr 
training ) Teaching m public schools averaged about 2 
years for faculty, of which most expenence was in 
bilingual education. (Here again, the mode for general 
teach'ing was 1 year compared to 1/2 year in bilingual 
education) 

Table 8 i 

ACADEMIC AND EXPERIENTIAL BACKGROUND 
OF SAMPLED PROGRAMS (n=64) 



Directors 



Program Faculty 



CRITERIA 


X 


5 


Modt 


X 


s 


Mod« 


Teacher Training 
fn-64) 


7001 


4110 


5000 


5 377 


4 388 


2000 


Bilingual Teacher 
Training (n— 64) 


5401 


4414 


2 000 


2 4896 


3 369 


1 000 


Teaching (Public i 
School) (n-60) 


5216 


5213 


1 000 


2 121 


5912 


1 000 


Bilingual \ 
Teaching (n— 60) > 


4 704 


5910 


1 000 


1 997 


4 274 


0050 


Academic Prep 
(n-64) 


5 252 


4012 


5 000 


4 120 


0 062 


2 600 


Assessment Univ 
Level (n-64) 


4 587 


3465 


4 000 


1 5202 


2221 


1 000^ 


As^«■;ssmenl Public 
School ln-62) 


i782 


3 552 


1 000 


1 821 


5 122 


1 000 



In terms of academic preparation, directors averaged 
less than 5 1/2 years of formal (e g college, university) 
and informal (e g workshops, seminar etc ) training in 
second language acquisition and language assessment 
subjects areas Faculty compared with a little Over 4 
years As for an actual language assessment expenence' 
in college and university training programs, directors 
averaged a little more thai" 4 1/2 years compared to a 
little more than a year-and-a-half for faculty Language 
assessmerU experience in public school averaged a 
little less than 3 years for directors and a little more 
than a year-and-a-half for faculty ^ 



Data from the RMC sample reveals that 90% of the pro- 
gram directors and 85% of the educatior) faculty have or 
had at one time a teaching credential This in itself does 
not denote any large-scale experience on behalf of 
these individuals since such credentials in most in- 
stances can be obtained upon graduation from an ac- 
credited teacher training institution. Data from the RMC 
sample^indicated the larger numbers of program faculty 
with doctoral degrees. (Of the doctoral specializations, 
js23% were in c,Urriculum and instruction, another 15% in 
other educational areas, and 24% in language and 
linguistics.) About half the faculty had experience in a 
classroom with limited-English proficient students. Less 
than half of either group had experience of any sort in a 
bilingual classroom. Unfortunately, doia from the RMC 
study regarding exact numbers of years experience 
were not available for specific comparisons with this au- 
thor's sample. However, in genera', the patterns of both 
studies appeared to corroborate each other as far as 
general trends. 

The ramifications of the aforementioned data are such'' 
to suggest that current faculty may have much Mess 
overall teaching experience in public school sections, 
as well as less specific experience with limited-English 
proficient students, than actual teachers in public 
schools. Doctoral degrees might not offer a substantial 
substitution since, as one of my eminent colleagues so 
eloquently put it. the trainers of the teachers and the 
professionals in the college or university, who write 
about limited-English speaking students and offer 
potential directions, "may never have seen a kid in their 
entire lives within a classroom setting." The situation is 
further compounded by above-cited evidence which 
shows possible reluctance and/or inability of colleges 
and universities to involve experienced personnel from 
public schools in the decision-making process. The 
argument has been advanced that the relative "new- 
ness" of the field has, in essence, shaped the dilemma 
(Seidner and Seidner, 1981. b; 1982). We will attempt 
to offer some insight through our data based upon ap- 
proaching the issue of institutionalization. Henfy Pascu-. 
al (1 979) reminds us of our enthusiasm to 'apply severe 
criticism to public school programs," or pressure upon 
publishers for better materials. Yet, we somehow ne- 
glect to question colleges and universities which 
remain sacrosanct and appear to "do no wrong." 

Inherent in this statement is the ultimate concern of in- 
stitutional commitment to programs and to the actual 
philosophical premises of bilingualism and bilingual 
education, For purposes of this paper, two particular 
concerns have been identified which relate directly to 
this study: a commitment of resources to the program, 
and b commitment to continue the program beyond 
the span of time dictated by funding sources other than 
from the university. Obviously, both concerns will ulti- 
mately influence the establishment and continuity of en- 
trance/exit criteria for language assessment. 



'The survey queried respon(ients along similar lines of related experience for other involved facujty and public school personnel 
The returns were too mimscMe to make any type of overall judgments The small return indicating involvement from public schools 
in the assessment process yielded the following completed responses (n«6) Average no years bilingual teaching 4*4 1/2 yearc 
^^9^.40%). and 6 years (20%) 
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As sfiown earlier a miniscule percentage of respondents 
indicated the total support of programs by the respec- 
tive institution. On the surface, some commitment of in- 
stitutional resources appears with the observation of 
the RMC sample that the proportion of program direc- 
tors and faculty funded partially or entirely from institu- 
tional sources was quite high (Binkley, Johnson, et al., 
1981). A secondary observation was the acknowledge- 
ment of other bachelor's and master's level programs 
alongside Title VII funded programs in at least ten insti- 
tutions. It would be a naive assumption that colleges 
and universities are in the education business for sheer 
altruism.® An examination of a number of training pro- 
grams, which claimed some or all absorption of the 
director's and faculty salaries, shows various distribu- 
tions of funding among student reimbursements (tui- 
tion, materials, etc.). faculty costs (including trips), in- 
stitutional overhead, and other operational costs. If Title 
VII funding were to cease, would training institutions 
continue with commitment of programs? Given the 
data in Table 7. how significant are institutional pay- 
ments of directors' salaries in part or whole? 

Will this apply to other program faculty? The RMC 
study offers the beliefs of interviewed directors that the 
institutions would consume funding programs beyond 
fiscal expectations. How valid are these kinds of remarks 
from respondents who are seeking funding, when inter- 
viewed by quasi-representatives of the ultimate funding 
sources?/ An important piece of the puzzle was cited 
earlier, namely the 15% of respondents who took the 
time to indicate that their programs were no longer in 
existence. Another study would be welcome to examine 
the extent of this phenomenon, in relation to particular 
sources of funding and institutional commitment. 

Another important characteristic of institutional com- 
mitment would appear to be one of retraining. If institu- 
tions rely upon personnel from departments other than 
bilingual, claim a shortage in trained professionals 
- (Binkley. Johnson, et al.. 1981) or stress the relative tnci- 
piency of such training programs, what is the 
commitment then, to provide adequate staffing? Part of 
the answer may be found perhaps through inservice 
retraining. At least one natiorral-t^sk force recommend- 
ed a locus of support^fqr jnser\(ice trairimg in colleges of 
education (Kersh. 1978). No evidence has surfaced to 
substantiate an independent initiative or commitment 
of this nature. In fact, a recent study^-subsidized by the 
American Association of Colleges for Teacher Education 



examined the commitment of 17 institutions to inser- 
vice education. The potential position of the Office of 
Bilingual Education and Language Minority Affairs in 
securing funds and credentials was deemed a key factor 
in institutionalization (Carey and Marsh. 1980). Addi- 
tional light may be shed on the nature of commitment 
(or lack therof) by examining the actual profile of im- 
mediate faculty within bilingual programs. 

As shown above, an analysis of data collected from Our 
sample revealed that directors of programs are the most 
experienced personnel. A further examination supported 
the premise that a large number of individuals were ens- 
conced in areas other than bilingual at the onset of Title 
VII funding. However, the mean years of program exis- 
tence exceeds the mean years of faculty teacher training 
experience. The large distances between directors and 
immediate program faculty suggested a lack of long- 
range commitment by training institutions to maintain 
the latter. Additional information from the RMC study 
shows that only 31% of all program directors had 
tenure, while another 34% were in tenure-track posi- 
tions. The remainder were employed along non-tenure 
lines. Over 60%. of faculty in bachelors/credentials pro- 
grams were on tenure-tracks, the percentage increasing 
(for directors as well) as the academic level (i.e.. mas- 
ter's, doctoral) increased. The citation of close to 60% of 
the tenure-track individuals, as tenured, points to their 
incorporation in definition from other departments as 
immediate program faculty. (Otherwise, the inconsisten- 
cy of mean years for program existence and the average 
of seven years for tenure would remain uriresolved.) The 
highest overall evidence of academic rank for directors 
occurred on assistant and associate professor levels, 
compared to instructor and assistant professor for facul- 
ty. In short, university hiring and retention policy may 
be a critical factor affecting alternative choices for 
selecting processes and entry/exit-level criteria for lan- 
guage assessment of potential bilingual teachers. Per- 
haps any reluctance vis-a-vis commitments by institu- 
tions is due in some part to the relatively short nature of 
existence of bilingual training programs. Colleges and 
universities seem to favor a philosophic framework, ex- 
pressed by one institution as the incorporation of faculty 
which are "academically respectable and appeal to the 
scholarly" (Burke. 1972: See also Broudy. 1962: and 
Weber. 1972; Arends. 1973). Although the goal may be 
praiseworthy, the interpretation of terms by a select 
guardianship might at least prove troublesome. This is 
especially so for a relatively new program in its quest 



h'he Carnegie Council on Policy Studies in Higher Education predicts that enrollment will fall even more than the size of the historic 
college-age cohort of 18-21 (1 980). and colleges and universities will compete for even more scarce students in destructive way 
lr)Stitutions of higher education ultimately are seeking other sources of funding to venture beyond, what may be considered in 
numerous incidences, survival levels (Kramer. 1980: May hew. 1979: U.S. National Center for Educational Statistics. GPO Annual 
(1979. 1980); Academy for Educational Development. 1971: Carnegie Council on Policy Studies in Higher Education. 1979). 

number of respondents from the sample when spoken to after the survey in an informal setting expressed fears that the institu- 
tion would not continue. . the programs beyond the expiration date of federal programs. They describe their respective institutes 
' with such descriptive terms as "cannibalistic." and "prostitute." Also consider Franzen's and Lazarsfeld's study on conditions and 
validity of interviewing. 

^During fiscal year 1980-1981. schools of education at 27 institutions received over $1 million in Title VII funds to design first-year 
training programs. One hundred arjd thirty colleges/univfersities and 38 other applicants received $18.5 million to improve teacher 
training programs and to prepare bilingual personnel. Forty-four colleges and universities received more than $4.5 million in fel- 
lowships to post-graduate students. Twelve universities, four school districts and three nonprofit organizations received $9 million 

^ »o operate 19 bilingual education service centers. Eight universities, four school districts and five materials development centers re- 

Y 3ived $6 5 million to develop curriculum materials (U.S. Department of Education. 1981). 
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for academic credibility, which seeks to withstand the 
test of time. The factor of continuity is obviously critical 
and portentously mortal for the implementation of 
these processes and contingent upon institutional com- v>:^ 
mitment of resources and maintaining experienced*^ 
personnel. * 

Discussion and Summary: / 

Before proceeding to a description of our mocf^TsTin 
brief summation: 



1. 



2. 
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A pattern of decision making emerged with a sub- 
stantial though modest number of directors 
making decisions for entry/exit level criteria in lan- 
guage assessment. As predicted, involvement of 
public school personnel would be minimal. 

One of the expected outcomes was the larger rate 
of participation of community personnel. Also as an- 
ticipated, the largest incidence occurred with pro- 
grams sharing deci^ons between director and 
bilingual faculty, followed by combinations with 
faculty from other sectors than the immediate pro- 
gram. An interesting high correlation between stu- 
dent and community' participation was an expected 
outcome (n«63, r^ « .409, p<.05). 

An analysis of data revealed a tendency for director- 
made decisions to modestly correlate with a prefer- 
ence for commercial instruments, particularly 
|discrete-point tests in other languages (n=51, r ^ 
.3634, p<.05). As predicted, assessment conducted 
by faculty members from sectors other than the im- 
mediate bilingual program showed similar selection 
of instruments for English and other target lan- 
guages. Selections of other combinations of instru- 
ments, including observation measures and meth- 
ods, were favored by respondents with shared deci- 
sion making which involved combinations of 
sectors. 

It jwas hypothesized that bilingual faculty would 
halve a low number of years of experience in the 
public sector. Unexpectedly, an analysis of data 
showed a significant difference between experience 
in public schools, colleges/universities and academ- 
ic preparation between directors of programs and 
bjlingual faculty the differences were large enough 
to lend to speculation as to the nature of institution- 
. aj commitment to bilingual training programs. 

Significant data regarding the period of time of pro- 
grams' e)^istence and the number of years of teacher 
t aining experience for bilingual faculty raise a 
r umber of serious issues regarding the intentions 
ty colleges^and universities to maintain programs 
£nd retain faculty. Ultimately, decisions by institu- 
tions on commitment of resources and retention of 
(jxperienced faculty will effect the nature of 
ijntry/exit-lever critena and assessment processes 
n target languages. A secondary source of data in 
support of these issues emerged from a 15% return 
/vhich claimed discontinuation of programs by col- 
leges/universities due to the cessation of Title VII 



funding. It is questionable whether programs will 
continue on instifiitional funds folfowing termina- 
tion of Title VII funding. 

/ 



•A-^trong oorrelaticn wds obs^rve^'.between oppor- 
tuVtities for remediation and those for language utili- 
'zation. However, aVdistinction in entry/exit-level 
criteria was obvious\in language assessment for 
remediation purposes and for language in the con- 
text of academic subject ^(eas. 

Director-made decisions supported a trend av 
from contextual assessment\emphasizing remedia- 
tion. A unilateral emphasis upon remedial assess- 
ment, as hypothesized, was observed with decision 
and assessment processes being made by faculty, 
other than bilingual, for English and foreign 
languages. 

5. Although not uniform, to provide a general pattern, 
the data revealed some degree of influence, by a 
number of states with certification requirements in 
bilingual education, upon the assessment practices 
of respective respondents. Ip one of the most con- 
cerned and flexible of thes6 states, representatives 
from bilingual training programs served on an ad- 
visory board to recommend assessment procedures 
and certification requirements. (Illinois Association 
of College and University Educators in Bilingual Pro- 
grams, 1 981 ; Seidner and Seidner, 1 981 , a; 1 982). 

6. The highest percentage of respondents cited as- 
sessment of target languages conducted by all 
bilingual faculty followed by bilingual and other 
faculty, supporting the contention that sectors 
other than the immediate bilingual program within 
the university influenced assessment processes. 
Less than 1 0% of program directors unilaterally con- 
ducted program assessment. Instruments selected 
by foreign language departments and English lan- 
guage centers were mostly commercially produced. 
The utilization of commercial instruments suggest- 
ed a potential subordination of program objectives 
and entry/exit-level criteria to that originally estab- 
lished in the construction of the instruments. In a 
few instances, the assessor for target languages 
was drawn from the community and/or public 
school. 

7. As originally hypothesized, a greater heterogenous 
approach to assessment occurred in programs with 
increasing numbers of target languages. More com- 
binations of observation, interview, and question- 
naire techniques were observed to be employed. 
This may be partly due to a comparable lack of 
instruments in a number of target languages. 



The Model 

We nbw turn attention to theoretical models which re- 
flect bbservations regarding our sample of respondents. 
I wqiuld like to emphasize that any observations are 
solely confined to our immediate sample. Three models 
are'offered: (1) The first is centered around a homogene- 
ous director-made decisions model for entry/exit level 
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criteria in language a'ssessment (2) The second reflects 
varieties of heterogeneous combinations of decision 
making. (3) The third typifies homogeneous sectors, 
other than director-made decisions. 

Seven characteristics are offered in descriptions of 
these models which are limited to the sample. These in- 
clude characteristics of: 7. Quantification/Observation, 
2. Context of Approaches, 3. Flexibility of Instruments. 
4. Criteria Influence. 5. Assessment Sectors. 6, Numbers 
of Languages, and 7, Duration of Program, 

Characteristics 

1 . Quantificdtion/Observation (a): The choice of work- 
ing for this particular characteristic emanates from 
Shuy's (1978) observations in designing the iceberg 
effect. The selection of discrete-point instruments 
(e.g.i TOEFL) by some of the respondents in isolation 
or combination suggests more quantifiable and" 
testable surface aspects but less so for what Shuy 
termed critical features (Semantic meaning and 
functional meaning). Integrative testing approaches 
the critical features more so by not isolating fre- 
quencies as much from natural context or reducing 
items into such small components for measurement 
as discrete-point tests. Although they may give a 
more global picture of language ability (e.g.. 
"CLOZE procedures"), integrative tests are still diffi- 
cult to measure in natural context (i.e.. a context in 
which the language is not artificially elicited). 
Moreover, clarification is wanting as to what in- 
tegrative tests actually meansure (also see Shuy. 

. 1978). A third selection one of direct rating meth- 
" ods has been regarded as having a high degree of 

content validity, assessing a full range of pertinent 
skills (also see Clark. 1978). Problems with direct 
rating procedures, such as those developed by the 
Foreign Service Institute, occur with questions of in- 
terreliability among raters. The fourth selection of 
self-report ratings raises questions as to reliability 
of responses from reporters. Theoretically, the sub- 
ject acts from a base of honesty and accurately re- 
sponds to the items. The fifth selection of direct ob- 
servation appeared to correspond the most, in 
terms of Shuy. to critical features and was least 
quantifiable for surface structures (including 
videotaping, classroom observation, etc.) 

\. 

" . The first model reflected quantification/observation 
Choices homogeneously with discrete-point or in- 
tegrative testing. The second heterogeneous model 
represents more random combinations. The third, 
reflected homogeneous choices favoring the so- 
called critical areas. 

2. Context of Approaches (b) Respondents were char- 
acterized by assessment processes either geared to 
language remediation (English/target language 
skills and communicative competencies), and/or 
bilingual academic subject area approaches. As 
noted earlier, a substantial association existed be- 
tvy/een respondents' provision for remediation op- 
portunities and available academic experiences for 

© target language utilization. Contextual testing, in 
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the manner expressed by Bondaruk. Child and Ter-^ 
rault (Spolsky and Jones. 1975) refers to attempted 
measurement of an individual's ability to apply 
knowledge of grammar-lexis systems to specific 
points within discourse. Context of approach in the 
bilingual assessment process suggests linguistic 
applications, in contrast, to particular subject area 
points corresponding to selection of observation 
techniques by sampled respondents. A significant 
correlation emerged between director-decision 
making in entry/exit-level criteria and selection of 
discrete-point instruments for other target lan- 
guages (n-56. rg-.3307. sig.-.013. p<.05). 

In comparison, the highest incidence of observation 
techniques, included in combination with other ap- 
proaches, accrued more readily with program facul- 
ty decision making. 

Here again. Model I was characterized by a homoge- 
nous preference for remediation-type approaches. 
Model II reflected combinations by respondents of 
approaches, while Model III favored homogeneous 
remediation type, on academic subject context ob- 
servation techniques. 

3. Flexibility of Instruments (c) This characteristic dif- 
fers from the one previously cited in that it indicates 
to what degree of combination, respondents pro- 
vide for instruments appropriate to described ap- 
proaches. Spolsky and Jones (1975). Oiler and Per- 
kins (1980) among others, endorsed the premise of 
using combinations in assessment approaches. The 
heterogeneous model reflected the tendency 
toward the greatest flexibility of approaches with 
over 70% of the sample respondents. The remu.i^ing 
percentages characterized one type of instrument, 
when utilized, at both ends of the spectrum for 
Model I and for Model III. 

4. Criteria Influence (d) An important consideration for 
our model is the degree of influence from sectors 
outside the program upon the determination of lan- 
guage assessment criteria. Internal administrative 
pressure may be a consideration in institutions for 
such programs as the one for Chamorro. in Guam. 
At the same time, a varying degree of influence was 
experienced with faculty other than the bilingual 
program in determining English and other target 
languages. An interesting pattern, cited above, was 
observed with the selection of a particular instru- 
ment (e.g.. FSI. MLAT. etc.) by programs in states 
with particular mandates. The factor of Title VII re- 
quirements was another consideration this study 
was unable to ascertain regarding degrees of 
influence. 

The limitations of interpretations for the particular 
sample are once again underscored, given the 
changing nature of the Title VII regulations. The 
data failed to place criteria influence as tangential 
to one model to the exclusion of others. They ap- 
peared to some degree as characteristics for all 
rhree models. 
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Assessment Sectors (e) Our fifth characfteristic 
directly relates to previously crted concerns as to 
sources of decision making and assesalment of 
languages In brief summation, director-nr^de deci- 
sions characterized the first homogeneous model, 
while combinations of sources typified me hetero^ 
geneous second model. The third modef was char- 
acterized by homogeneous sectors, othjer than the 
director. 

Number of Languages (t) Another prevailing feature 
for our models was the number of lariguages. Most 
programs cited one language in addit/on to English. 
The general tendency in this category was toward 
Spanish, although an occasional Varget language 
such as Yupik would appear m combination with' 
Engl'sh. As with the characteristic of criteria in- 
fluence, this characteristic was found tangential to 
one model, to the exclusion or others. However, the 
director-made decision homogenous model tended 
to subscnbe to the Spanish/English category. 

Duration of Program (g) The last characteristic 
refers to the number of yeats of existence for 
sampled programs. Aristotle tells us that "even men 
of experience succeed more than those who have 
theory without expehence" (cited in Broudy, 1962). 
It would seem to follow that longevity, in the sense 
of consistency of program existence, would yield 
some quality of experience. A general tendency 
emerged in which programs with increasing years 
of expenence beyond the mean exhibited a parallel 
preference for combinations of instruments and 
niore flexible choices for assessment approaches. 
The highest frequency of these responses occurred 
with the heterogeneous Model II. which was char- 
acterized by a variety of assessment approaches. 
The data also inferred that the majority of the two 
homogeneous models were in existence for a few 
years, in companson to the heterogeneous one. 



Diagram 1 presents us with a summarized overview 
of characteristics pertaining to our three theoretical 
models. Clearly these are not to be considered abso- 
lute within their theoretical framework, in view of 
aforementioned date which substantiates ranges of 
variation between stipulated models. These varia- 
tions are therefore implicit within the given 
structure. 

Conclusions and Recommendations 

1. Any assumptions drawn from this data must be ulti- 
mately limited to the samples of respondents. This 
is largely due to: 

a. a lack of substantial data base which is also 
consistent. Unfortunately, the literature has 
yielded little or no research to make definite 
comparisons. An obviou? need exists for further 
research in areas covered within this paper and 
related aspects. 

b. limitations of investigation approaches for this 
study. The nature of survey research whether 
questionnaire and/or interview techniques are 
still subject to question, especially in view of 
the lack of substantiating verification from 
other studies. However, these approaches 
remain the most effective existing means for 
gathering this type of data. 

c. a potential shifting of population itiean (u). A 
replication of this study will be most welcome 
to provide much needed additional data. The 
shifting tides of federal funding and university 
responsiveness to a need for training bilingual 
educational personnel may. for example, contri- 
bute to altering the nature of existing programs. 



Model 'Charactenstic 



Sunfinfiarized Overview of Characteristics 
for Theoretical Models 



Preference 
for discrete- 
point and in- 
tegrative 
tests 



Preference 
for remedial 
context 



Preference 
for one 
instument 



Relative num- 
ber/degree 
of influence 
sectors 



Homogene- 
ous director- 
made 
decisions 



Variable degree 
of influence 



Usually less 
than mean 
number of 
years 



Variety of 

combinations 
of instru 
ments 



Preference 
for variety of 
context 



Preference 
for combina- 
tion 



Relative num- 
ber/degree 
of influence 
sectors 



Heterogene- 
ous combina- 
tions of deci- 
sion makers 



Variable degree 
of influence 



Usually ex- 
ceeded mean 
number of 
years 



Preference 
for obser- 
vation-type 
instruments 



Preference 
for academic 
subiect 
context 



Preference 
for one 
instrument* 



Relative num- 
ber/degree 
of influence 
sectors 



Homogene- 
ous sectors 
of decision 
makers other 
than directors 



Variable degree 
of influence 



Usually less 
than mean 
number of 
years or more 



r D^/^ 'when utilized -1 0 • ) 
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2. The observed discrepancy between pcogram faculty 
and directors in experience and academic back- 
ground suggests some very strong concerns regard- 
ing the caliber of individuals establishing entry/exit- 
level criteria. Future studies might perhaps explore 
these factors in ^relation to other training concerns. 
This would appear critical since a sizeable number 
of the sample expressed between a low to average 
satisfaction with their own assessment processes 
(54.7%). Similar indices might be revealed in future 
studies of teacher training practices. 

3. Whether the result of a relatively short span of exis 
tence. deliberate institutional policy, or a combina- 
tion thereof, respondent programs revealed a seri- 
ous deficit in trained personnel. Mandatory insti- 
tutes, such as the one for leadership training, 
should be augmented on a regional level with 
performance-based sessions on comprehensive lan- 
guage assessment approaches. 

4. The federal grant to institutions should be treated 
in its strictest sense as a contractual obligation. As 
such, long-range commitments by the college or 
university, beyond the federal funding scope, would 
be a desirable inclusion, with the consequence of 
total repayment by the institution of expenditures 
upon breach of contract. This business-as-usual ap- 
proach is not unrealistic and would assist funding 
sources in their efforts to determine those institu- 
tions with sincere commitments from the short- 
range expediency of others. 

5. The assessment processes, which are outlined in 
the contract, should include (along the same lines 
of reassuring in point 4) the mandatory participation 
of school personnel and/or community resources. A 
distinction between "real" and "token" participation 
can be established through a separate evaluation of 
these sectors which would serve as a check and 
balance. 

6. More effective federal monitoring of training pro- 
grams and their environments seems desirable 
from several vantage points. Investment in short- 
term increases in visits and cross checks from inde- 
pendent sources may in the long run prove cost- 
effective in justifying expenditures for programs. At 
the same time, these visits may serve as an effective 
deterrent to potential abuses. 

7. Attention has been given to describing target com- 
petencies for teachers of limited-English proficient 
populations in public schools. A need exists to 
delineate similar competencies for their teacher 
trainers in colleges and universities. As the locus of 
funding power in the nation, the Office of Education 
might initiate this process in collaboration with na- 
tional and/or regional accrediting bodies. Incorpora- 
tion of flexible criteria into federal guidelines could 
ultimately contribute to providing target personnel 
who would better serve limited-English proficient 
populations, rather than catering to administrative 
structures and priorities of institutions. 

O 
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B. Training programs with increasing numbers of 
target languages may pose problems in the acquisi- 
tion of personnel and materials, the structure and 
schedule of academic coursework and remedial 
classes, as well as the establishment of language 
entry/exit-level criteria. The commitment of federal, 
institutional and perhaps in a number of instances, 
state resources is essential toward providing the 
personnel and environment. 

The need to focus our attentions upon the col- 
lege/university sector is increasingly imperative, 
given the myriad of complex political, social and 
economic pressures on the horizon, which continue 
to threaten the existence of programs for limited- 
English proficient students currently attending 
public services. 

With this in mind, it is hoped that this study is a 
beginning in a series of others which will effica- 
ciously increase our current knowledge regarding 
teacher training processes in colleges and universi- 
ties, promote educationally sound decisions and im- 
prove upon current practices. 
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ISSUES OF SPECIAL EDUCATION AND LANGUAGE ASSESSMENT 
FOR THE LIMITED^ENGLISH PROFICIENT STUDENT 
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When I was asked to write about special education or 
the special needs of the linguistic minority. I was quite 
reluctant to lay claim to an area that was out of my ex- 
pertise (special education). I did not respond to the invi- 
tation until I was sure that my expertise, curriculum 
theory-bilingual education, would be a valuable contri- 
bution to the field of bilingual-special education. I finally 
accepted this challenge when I realized that for me to 
talk about special education from a technical point of 
view would be foolish, if not outnght dishonest, but still. 
I could analyze the concept of "special needs" if this 
domain were analyzed from a sociopolitical paradigm. 

The purpose of this paper is. then, to analyze ihe con- 
cept of special needs ds described from a sociopolitical 
level. This paper will not deal with specific treatments 
in cases of mental retardation, hyperactivity or other 
conditions that may have some medical history. I will 
leave these issues to the medical field where proper and 
competent professionals are familiar with the issues. 

The Concept of Category in a Special Needs Situation 

The concept of category or the process of categorization 
IS a logical place to begin our analysis. Some of the lit- 
erature assumes that there is always a direct relation- 
ship between special education and bilingual education. 
For example. Chapter 71 A; the Transitional Bilingual 
Education Act of Massachusetts, the Courts' Declaration 
of Policy was the following: 

Section 1 . Declaration of Policy 

"The General Court finds that there are large num- 
bers of children in the Commonwealth who come 
from environments where the primary language is 
other than English. Expenence has shown that 
public school classes in which instruction is given 
only in English are often inadequate for the educa- 
tion of children whose native tongue is another lan- 
guage. The General Court believes that a compensa- 
tory program of transitional bilingual education can 
meet the needs of these children and facilitate their 
integration into the regular public school curricu- 
lum. Therefore, pursuant to the policy of the Com- 
monwealth to insure equal educational opportunity 
to every child, and in recognition of the needs of 
children of limited English-speaking ability, it is the 
purpose of this act to provide for the establishment 
of transitional bilingual education programs in the 
\ public schools and to provide supplemental fman- 
ciai assistance to help local school districts to meet 
the extra costs of such programs" 



The concept of categorization through this act is in that 
transitional bilingual education becomes compensatory 
in nature, in which the bilingual population must 
eventually become part of the "regular" public school 
curriculum. Through this act public schools were now 
mandated to develop transitional bilingual education 
under specific definitions. They were: 

1. Classrooms in this domain were compensatory 
in nature. 

2. They were not part of the regular program. 

3. The transition toward English was the objective 
of the program. Native language instruction 
was eventually eliminated. 

This political-educational pardigm becomes now a "spe- 
cial" mission or a "special education" process for a 
specific population, the linguistic minority. It does not 
accept these children as regular children who just 
happen to speak another language. The political/pe- 
dagogical consequences of these kinds of definitions 
may take different avenues. In this case, bilingualism is 
not an asset, but a deficit, for a special population. 

Federal legislation also views bilingualism from the 
point of View that the limited-English proficient child 
has a deficit (not speaking English) and is thus catego- 
rized from a negative point of view. The Bilingual Educa- 
tion Act of 1968 (popularly known as Title VII) is a clas- 
sic example. Although it is believed that federal legisla- 
tion was passed to preserve language and culture out- 
side the English language, the federal legislation was 
passed under different assumptions. First, the long- 
range goal was not bilingualism. but proficiency in En- 
glish (National Institute of Education 1975:6) Other 
stipulations such as poverty criteria or percentages of 
proficient English-speaking children in a classroom, 
classify a specific population outside mainstream socie- 
ty. Again, it is important to note how society at large 
and the educational enterprise view a special population 
with "special problems." 

The concept of categorization becomes more complex 
when one questions the idea of "special." "abnormal." 
limited-English proficient" or other concepts that are ap- 
plied to educational theory and practice. If one assumes 
that within the human range of behavior and conditions 
there is a diversity of differences, then all behavior or 
conditions should be treated as "normal" education. 
That is. there is nothing special or different otitside the 
so-called deferred norm, but all behaviors, including 
speaking a different language, are conditions that have 
pedagogical or educational solutions. Once a behavior 
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is identified as a concept that belongs to a "special" 
category, then hegenomy may take place and the beha- 
vior is placed outside the norm. Bergin cites the process 
of categorization when she analyzes the Riverside 
Study. She states that: 

1. Public schools were the major labelers. 

2. Public schools shared their labels widely with 
other agencies. 

3. Black and Spanish-surnamed children were 
more likely to score seventy-nine or below on 
an I.Q. test than Anglo children. 

4. Among those scoring below seventy-nine on 
an I.Q. test, childrea^^who were Spanish- 
surnamed and who were from low socioe- 
conomic levels were more likely to be placed in 
special classes. 

5. Only 19 percent of the children placed in 
classes for the mentally retarded ever returned 
to the mainstream school program. 

6. Black and Spanish-surnamed children were 
"over-labeled" as mentally retarded and Anglos 
were "under-labeled." 

These conclusions are frightening if one follows 
through the logic that it is essential to label or catego- 
rize for educational purposes. One could easily conclude 
that children who are Black or who heive a Spanish sur- 
name tend to have low I.Q.'s and very rarely return to 
"mainstream" education. On the other hand. Anglos 
tend to have less special education problems than 
Blacks and Hispanics. The literature of incorrect labeling 
and the School's legitimizing of "special" problems is 
substantial. Beeghley and Butler talk about the conse- 
quences of intelligence testing in the public schools 
before and after desegregation. They state that the 
schools functioned to facilitate differential labeling of 
mental retardates. The labeling process has resulted in 
a form of institutional racism in special education pro- 
grams regardless of integration in the regular classroom 
(p. 746). In other words, the desegregation process in 
itself did not alleviate labeling children and the schools 
continued to categorize racial or linguistic minorities as 
"mentally retarded" or place them in "special" education 
clasi3es. Beeghley and Butler cite the following statistics 
in their research, (p. 746) 



Table I 



PROGRAM PARTtClPATION AND 
ETHNICITY 1968 



Type of Program 


Anglo 


Ethnicity 
Stack 


Chicano 


Educable Mentally 
Retarded 




82% 


87% 


Trainable tylHritaliy 
Rfttardf?d 


9% 


5% 


2% 


EduratJnnally 
Handicapped 


m 


7% 


5% 


Physically 
HarK<K apped 


18% 


7% 


7% 




100"/o 
(347) 


100% 
(126) 


101%* 

(201) 



X Mqual!, /8.018P 001 




If one analyzes these statistics carefully. Black and Chi- 
cano children have a high percentage of "educables," 
while Anglo children are less categorized as "educa- 
bles." When any school system or society accepts that a 
specific population has more "mental retardation" than 
another race or group, one must take a second look. 

Kamin studies this issue from a. sociopolitical perspec- 
tive. His research illustrates that I.Q. tests and their use 
as classification instruments are not neutral. His analysis 
suggests that biases are not only based in thP ir.stru 
ment but also in our "deep culture." present and paot 
which helps to support misclassification (p.317). 

The literature of the misuse of instruments and the mis- 
classification of students is abundant to conclude that 
the concept of categorization, especially in the field of 
bilingual-special education, must be used with extreme 
caution. This concept should be further examined when 
one approaches the idea of statistical norming in a 
special-bilingual education population. 

We will now look at this issue of what might be catego- 
rized as "normal" and what is categorized as "special." 



The Normal — Special Education Paradigm: 
A Look from a Minority Perspective 

I believe it is correct to suggest that not everyone has 
the same abilities to perform general and specific tasks. 
There is a range of human behavior and human ability in 
humanity, regardless of the language use of the indi- 
viduals. It may be possible that a statistical norm may 
belong to each language c roup because each group 
may have its own "norming f^ffect" toward a sociopoliti- 
cal norm That is. what is "st* ndard" or what is a "norm" 
is a political categorization e/en if one can legitimize a 
norm through statistical mecns. Haugen talks about 
norm from another perspective. When he talks about 
language use. he states: 

The concept of 'norm' in reference to language 
is highly ambiguous and slippery. It may refer 
to a standardized language like French, codified 
in grammars and sanctified by an Academy, 
taught in schools, and written by authors, but 
spoken by no one. except under duress. Any de- 
viation from such a norm is deemed to reveal 
one's lack of a proper education and is regarded 
as barbarism if it is unintentional. But it may be 
acceptable if it is an intentional stylistic varia- 
tion, either as a mockery of the lower classes or 
as a relaxation of standards, a kind of 'old shoe.' 
(p. 91) 

Here, the concept of norm becomes problematical in 
that social class language becomes a valued norm, but 
it still can be used as a statistical norm to legitimate a 
curve to differentiate between "normal" or "standard" 
language and "nonstandard" language. In other words, if 
one uses a norm to divide dialects from "standard lan- 
guage" from the point of view of categorization, many of 
our children may be incorrectly placed in "special" situa- 
tions. Although this logic seems obvious, there are still 
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many complex issues that revolve around language use 
in a monolingual and bilingual environment. Placing 
children in a statistical norm does not address the 
linguistic issues in a so-called bilingual-special educa- 
tion situation. 

It is true, however, that educators must use some defini- 
tions to prescribe educational environments for stu/ 
dents. As stated previously, within any linguistic popu- 
lation, there is a range of abilities — what might be 
categorized from "mental retardation" to the "excepjtion- 
al" child. We could accept behaviors and languages as 
"states of being" during one's lifetime and this obncept 
could lessen the impact of the category of ^/special." 
That is. within any specific language population, the so- 
called bell-shaped curve describes differej/t "states of 
being" during one's lifetime, and this ^ncept could 
lessen the impact of the category of "special." Within 
any specific language population, th^ so-called bell- 
shaped curve describes different "states of being" that 
identify specific needs and prescrib^s^ specific education- 
al environments. One may now argue that the problem 
of special education and the problem of states of being 
is only in semantics and the poncept of categorization 
has not been eliminated. Tho^key to solve this problem 
may be "norming" spec>fic language groups and 
prescribing educational e/ivironments within each lan- 
guage group. To make X\^s problem logical, let us exam- 
ine Figure I. / 

Figure I 

/ 

A STATISTICAL NORM FOR ABILITIES 
/in THE UNITED STATES 




In Figure I. the curve may be used to identify placement 
of individuals with reference to general and specific abil- 
ities. Children are usually classiied in percentiles with 
reference to categories, i.e.. reading comprehension, 
math ability, social studies and so on. If a child is below 
the 50th percentile, he or she is considered "below the 
norm." and. of course, if the child is above the 50th per- 
centile, he/she is considered above the norm. It may be 
correct to suggest that this curve is more or less accu- 
rate when one is speaking only of a particular linguistic 
community, but Table I implies that more Blacks and 
Chicanes are below the "norm" by the percentages of 
educable mentally retarded students in each population. 
If one recalls, there were significantly more Black and 
Chicano children placed in "special" education classes 
than Anglos. The data seems to suggest that we must 
rethink the statistical norm to fit the linguistic language 
minorities in the United States. Let us assume that each 
linguistic population has its own bell-shaped curve 
within Its own cultural or political context. If one ac- 
cepts this paradigm ar if one accepts the current re- 
search data that racia jnd linguistic minohties have a 
higher placement in 'special education" classes. Figure 
II may illustrate a visual description of our problem. 



Figure II 

NORMING FOR DIFFERENT LINGUISTIC 
POPULATIONS 




Language Minority Norm 
English or Anglo Norm 



One may note that the "English-Anglo" curve or no^m 
and the "Language Minority Norm" come from twc dif- 
ferent "standards." and both curves have their own 
statistical "mean." Hence, if one acquires the skills and 
expertise to identify "special" situations in each lan- 
guage group, the placement of minority children v^ill be 
less political. 

It is important to note that the author of this paper is not 
suggesting that each population should be segregated 
totally within its own "special education" situation. 
(Which, by the way. becomes a monolingual situation 
instead of a bilingual environment) The author is imply- 
ing that the process of categorization as it now takes 
place in special education within the context of lan- 
guage is now labeling and misplacing language minority 
children in "special" situations in which they do not 
belong. Simply said and accepting all the complexities, 
some of the children do not have the "knowledge 
goods" 01 "cultural capital" (English) to compete with 
another population. The present process seems tu seg- 
regate more children into "special" situations because 
of the lack of instruments for the language minority 
populations. This is a process that is unacceptable in 
our educational institutions. 

Some Comments and Conclusions on the Process of 
Categorization in a Special-Bilingual Education Situation 

We must confront the issue that most paradigms that 
categorize human beings are political in nature, except 
where severe physical handicaps are involved. The 
"liberalism" of the sixties and of the seventies which pro- 
duced new systems or categories to legitimately acquire 
a process of equal opportunity for all groups may have 
failed because it failed to take into account the "deep 
culture" of its own value system. It seems to be correc* 
when some social scientists and educators stated that 
in the final analysis, the new sociology fails in spite of 
its desire for radical and fundamental change with refer- 
ence to the concept of egalitarianism. Its failure is in the 
inability to illuminate how social and political structures 
function to mask reality and promote ideological 
hegemony (Gramsci. 1971: Entwistle. 1978; Giroux. 
Penna. 1 979). To justify this situation is to continue seg- 
regation, deny equal access to the minority and eventu- 
ally maintain two societies, separate and unequal I 
hope this is not the case. 
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Dr. Simoes (1981) noted and gave evidence of the socio- 
political perception of the bilingual child. In this para- 
digm, he highlighted the negative view of the bilingual, 
and in particular, the perception that the bilingual is 
deficient in his abilities as well as in need of compensa- 
tory education. Perhaps at this point, further elucidation 
of the integration of psychology, bilinguaism. and the 
historical roots of this merger would be appropriate. 
This paper will focus on the effects of bilingualism on in- 
telligence. an area heavily investigated in psychology 
and eluded to in Simoes' presentation (1981). 

Numerous studies determining the effects of bilingual- 
Ism on intelligence present contradictory results, rang- 
ing along the gamut from detrimental effects to the en- 
hancement of intelligence. The bulk of the earlier studies 
indicated bilingualism to have negative effects on in- 
telligence. However, the ability to generalize from the re- 
sults of these studies is seriously curtailed by the lack of 
adequate experimental controls and methodological 
practices that bring the validity of their conclusions into 
question. 

An early study in this area was conducted by Saer 
(1923) employing 1.400 bilingual urban and rural Welch 
children. The results of a Stanford Binet intelligence test 
(translation was provided), vocabulary, composition 
and dextrality tests indicated the intellectual superiority 
of monolingual children and was interpreted in terms of 
"mental confusion" experienced by bilinguals. Similarly. 
Smith (1923). Yoshioka (1929). Wang (1926). Mead 
(1 927) and Rigg (1 928) conducted studies with a variety 
of bilingual groups and mdlC3ted the inferior intellectual 
ability of bilinguals. However, the lack of experimental 
controls for vital factors as age. sex. and socioeconomic 
status and subjective scoring procedures (Smith. 1923) 
and measures of bilingualism (Wang. 1926; Mead. 
1927; Rigg. 1928) limit the generalizability of these re- 
sults. Additionally, many of the tests administered were 
verbal in nature which could artificially depress the IQ 
scores of bilinguals who may have a language disadvan- 
tage because of limited proficiency in the test language. 

Several authors addressed this latter issue by admin- 
istering nonverbal intelligence tests. A minority of these 
studies (Jones and Stewart. 1951; Anastasi and Cordo- 
va. 1953. Anastasi and De Jesus. 1953) indicated the in- 
tellectual superiority of monolinguals over biJinguals. It 
' should be noted that some of the aforementioned meth- 
odological concerns are In question In these studies as 
well as the lack of testing sophistication in the bilingual 
group. Interestingly, in a later study (Jones. 1959) when 
the two linguistic groups were equated for socioe- 
conomic status. It was found that there no longer was a 
significant difference in IQ scores obtained on. the non- 
verbal test, dramatically accenting the impdrtance of 
controlling this frequently overlooked criteria, The 



majority of studies (Pinter and Keller. 1922; Barke. 
1933; Arsenian. 1937; Seidl. 1937; Darcy. 1946; 
Johnf.on. 1953; Carrow. 1957) indicated no statistically 
significant difference between monolingual and bilingu- 
al IQ scores as measured by nonverbal tests. Adding to 
the confusion, a number of studies (Hill. 1936; Spoerl. 
1944; Kolaska. 1954) employing verbal intelligence 
tests found no significant difference between mono- 
lingual and bilingual IQ scores, and a few studies (Davis 
and Hughes. 1927; Stark. 1940; Peal and Lambert. 
1962) using verbal as well as nonverbal IQ tests noted a 
positive correlation. 

To better understand the interaction between intelli- 
gence and bilingualism. one must critically evaluate the 
past research. First, the lack of uniformity in defining bil- 
ingualism limits the comparison between studies and 
the ability to generalize. Only two studies employed ob- 
jective criteria and measures of bilingualism (Johnson. 
1953; Peal and Lambert. 1962). Other studies provided 
either subjective measures such as questionnaires or no 
control over this variable. For example. Pinter (1932) 
used the child's surname to determine bilingualism. 
Pinter and Keller (1922) determined bilingualism based 
on the parents' nationality, and Davles and Hughes 
(1927) assumed bilingualism based on the Jewish eth- 
nicity of the subjects. 

In determining the bilingualism of subjects, several 
criteria must be considered. First. O'Doherty (1958) 
notes the distinction between the balanced and pseudo- 
bilingual. The pseudo-bilingual is one who is far more 
oroficient in one language.than another and does not 
use his second language as a means of communication. 
A balanced bilingual is proficient in both languages and 
has the ability to use either one as a means of communi- 
cation. Much of the research indicating the detrimental 
effects of bilingualism on intellectual functioning em- 
ployed pseudo-bilinguals. a highly questionable practice 
in view of O'Doherty's (1 957. p. 285) claim. "The pseudo- 
bflmguai4s the real problem, since very often he fails to 
master either language, while the bilingual by definition 
has mastered both." The importance is further supported 
by Peal and Lambert (1962) who employed balanced 
bilinguals and in fact showed bilingualism to have a 
positive effect on cognitive functioning. 

A second significant differentiation, which is dependent 
upon the acquisition method of the two languages, is 
between compound and coordinate bilinguals. Osgood 
and Ervin (1 965) describe compound bilingualism as ac- 
quiring both languages' in the same context of learning 
one through the medium of the other. Thus, the two lan- 
guages comprise a single system. The coordinate 
bilingual acquires the two languages in two different 
contexts. These two languages are acquired at different 
times, places, or concurrent life situations, and thus. 
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parallel language structures evolve. These differing 
linguistic structures in bilingual individuals may affect 
the nature of their bilingualism. 

A third consideration is the age of onset of the second 
language. This issue is much debated in the research lit- 
erature with some researchers claiming linguistic supe- 
riority of early onset bilingualism (Judd. 1927; 
Castillejo. 1933; Arsenian. 1937; Penfield. 1953) while 
others vehemently argue. "It is suggested that bilingual- 
ism in young children is a hardship and devoid of appar- 
ent advantage, because bilingualism appears to require 
a certain degree of mental maturation for its successful 
mastery." (Yoshioka. 1929. p. 476). Although the optimal 
age of second language learning is disputed, it is agreed 
that "the time at which the second language is intro- 
duced to the young child may be one of the critical 
dimensions of the ultimate cognitive effects of being 
bilingual." (John and Horner. 1971. p. 171) 

Sociocultural and psychological factors can affect bil- 
inguailism and thus should be accounted for in the ex- 
perimental paradigm. Weinrich (1970) notes several 
extra-linguisttc factors that affect the nature of bil- 
ingualism such as size and sociocultural homogeneity 
of the bilingual group, as well as idiosyncratic and ste- 
reotyped attitudes towards the languages, associated 
culture, and bilingualism itself. Several studies (Chris- 
tophersen. 1948; Anisfeld. Bogo. and Lambert. 1961; 
Levine, 1969) point to affective associations with a lan- 
guage which affect bilingualism. Fishman (1968) under- 
scores the necessity of looking at a comprehensive soci- 
olinguistic model in comprehending the functioning of 
the bilingual. 

Thus, the scrutiny of many complex variables is a 
mandatory step which must precede the evolution of a 
uniform definition of bilingualism. This definition is 
mandatory if research is to generate broad psycho- 
linguistic pnnciples regarding the effects of bilingualism 



on intelligence, yet. to date, this criteria remains 
unfulfilled. 

Additionally, one can be critical of past research due to 
the types of intelligence tests employed. As previously 
noted, many early studies (Smith. 1923; Portenier 
1947) used only verbal IQ tests which may artificially 
depress bilingual IQ scores because of language disad- 
vantage. Translations of verbal IQ tests were also em- 
ployed (Saer. 1922; Keston and Jiminez. 1954); howev- 
er, methodologically, this is. a highly questionable prac- 
tice since the translation had not been subjected to 
standardization procedures. In general. Darcy (1963) 
criticized the practice of using solely verbal IQ tests 
citing their lower validity as an intelligence measure. 

The lack of controls over variables such as age. sex, 
number and socioeconomic status was previously 
noted in much of the research. In addition, several stud- 
ies employed IQ tests involving time limits which have 
been shown to penalize the IQ scores of bilinguals 
(Knapp. 1960; Lewis, 1959). 

A final criticism stems from the lack of longitudinal data 
on the cognitive development of bilinguals. To obtain a 
comprehensive, holistic view of the bilinguals cognitive 
development and intellectual functioning, one must 
evaluate the child over a period of time, rather than at 
one point in time. 

In conclusion, the effects of bilingualism on Intelligence 
have been heavily researched, yet contradictions and 
ambigula have resulted . To some degree, this confu- 
sion may have unwittingly aided the negative percep- 
tion of the bilingual child documented by Simoes 
(1980). Perhaps more rigorous, methodologically soun- 
der research is mandatory to clarify the relationship be- 
tween IQ and bilingualism and dispell unwarranted per- 
ceptions of the bilingual child. 
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The data collection and language assessment policy In 
Illinois has been established through a dynamlcjjrQcess. 
This process involves a variety of individuals whose 
interests and expertise vary greatly. The resulting policy 
represents a conscioulsly directed effort toward a uni- 
form data collection procedure which is currently uti- 
lized by Illinois' Transitional Bilingual Education Pro- 
grams serving over 42.000 students. 

This paper will be divided into three sections. The first is 
a historical perspective of the evolution of bilingual pro- 
gram evaluation efforts. Included in this section are cur- 
rent activities and special projects that are provided to 
Transitional Bilingual Education Programs. 

The second section of the paper deals with Illinois' uni- 
form data collection procedures. These procedures pro- 
vide the framework for the longitudinal studies that 
began with the first annual Program Summary and 
Evaluatior) Report (19B0). Sections of this report, pre- 
pared and published by the Illinois State Board of Educa- 
tion, are included within this paper 

The third section of the paper outlines the information 
that was obtained as a result of the data collection and 
language assessment policy in Illinois. Primary evalua- 
tion questions were asked of the Illinois data. These 
questions dealt mainly with issues such as transition 
and exit rates, program participation Information and 
student identification information. Further analysis of 
achievement as a measure of program exit are also 
Included. This section closes with information on testing 
in^;truments that are used to assess language proficien- 
cy, dominance and achievement. 

1 . Historical Perspective : 

a. Bilingual Education in Illinois Prior to the 1976 
Mandate 

Prior to 1971 when the Bilingual Section of the 
Illinois State Board of Education (known then 
as OSPI) was established, local districts provid- 
ed programs for non-English speaking or 
limited-English proficient students at their own 
expense or through funds received from Title I. 
ESEA. earmarked for English as a Second 
Language. 

Bilingual Education in Illinois was implemented, 
for the most part, in districts which had high 
concentrations of limited-English proficient stu- 
dents. Local district personnel who were willing 
to write a proposal describing and documenting 
legitimate local needs received discretionary 
state funds. Long narratives were submitted ad- 
dressing the newly developed six goals of 
Bilingual Education: 



Goall: Students in the bilingual-bicultural 
program will achieve fluency and lit- 
eracy in two languages. 

Goal 2: Students in the bilingual-bicultural 
program will achieve at a rate com- 
mensurate with their own age. abili- 
ty, and grade level in all school sub- 
e ject areas. 

Goal 3: Students in the bilingual-bicultural 
program will demonstrate growth m 
self-esteem. 

Goal 4: Students in the bilingual-bicultural 
program will be provided with a 
coordinated and integrated learning 
environment through effective 
coordination with the regular school 
program. 

^Goal5: All teachers and staff members of 
participating schools will be involved 
in a cbmprehensive inservice train- 
ing program. 

Goal 6: Parents and other community mem- 
. bers will be involved in the planning, 
implementation, and evaluation of 
the bilingual-bicultural program. 

These goals were expected to be implemented 
in order to address the needs of bilingual stu- 
dents who were classified on the basis of lan- 
guage and/or performance. The following 
classifications which were tied to funding of 
bilingual programs from 1975-1979 were devel- 
oped. Districts were reimbursed by the bilingual 
section for students Levels l-IV who were re- 
ceiving program services. 

Level I: The student does not speak, under- 
stand, or write English, but may 
know a few isolated words or 
expressions. 

Level II : The student understands sinhple sen- 
tences in English, especially if 
spoken slowly, but does not speak 
English, except isolated words or 
expressions. 

Level III: The student speaks and understands 
English with hesitancy and difficulty. 
With effort and help, the student 
can carry on a conversation in En- 
glish, understand at least parts of 
lessons, and follow simple directions. 
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Level IV: The student speaks and understands 
English without apparent difficulty, 
but displays low achievement in- 
dicating some language or cultural 
interference with learning. 

Level V: The student speaks and understands 
both English and the home language 
^ without difficulty and displays 

\ normal academic achievement for 

grade level. 

LevehVl: The student (of non-English back- 
\ ground) either predominantly or ex- 
\clusively speaks English. 

Bilingual^vprograms in local districts were 
evaluated during the 1974-1977 school 
years utilizing a type of-^goal-centered eval- 
uation model knbyvn to districts as On-Site 
Evaluations. Districts were not selected at 
random for evaluations. They were, howev- 
er, selected to allow "representation of a 
cross-section of the programs in terms of 
language, school level, program model and 
strong and weak programs as judged by 
bilingual education program specialists 
from the state office." (Park, p. 3) 

The goals and strategies of the On-Site 
Evaluation included the following: 

• The first was to assist the local bilingu- 
al education program by providing im- 
mediate feedi)ack to the program staff, 
especially while the program was in 
the developmental stage. 

• The second goal of the On-Site Evalua- 
tion was to provide a vehicle for involv- 
ing and inservicing a cadre of people j 
interested in bilingual education to en- 
hance their information and expertise 
(Bilingual Education Service Center, 
1976). 

• The third goal of the On-Site Evaluation 
was to assist the State Board of educa- 
t iof^ to tdeottfytng overall strengths 
and weaknesses in bilingual programs 
across the state in order to influence 
program and funding priorities. 



b. Bilingual Education in 
Years (Pdst '76 Mandate) 



linois in Recent 
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In 1978, the bilingual section abandoned 
its efforts to conduct internal evaluations 
of its programs after the Transitional 
Bilingual Education mandate (Article 14-C 
of the School Code) was enacted. Educa- 
tional specialists were employed to monitor 
local programs and to assure cooperative 
efforts in achieving compliance with Article 
14-C and the Rules and Regulations for 
Transitional Bilingual Education (1976). 



The Illinois State Board of Education in 
June, 1979, received a final report on an 
evaluation of the state-funded Transitional 
Bilingual Education Program which was 
conducted by L. Miranda and Associates, a 
third-party evaluator. 

Although the findings of the evaluation 
were limited, the results, nevertheless, 
pointed toward some specific weaknesses 
in the program that were in need of 
remediation: 

—The need to establish a management in- 
formation system to maintain complete, 
accurate, uniform and comprehensive 
program data; 

—The need to establish a framework to 
guide planning, operation and evaluation 
of Transitional Bilingual Education 
Programs; 

—The need to establish a goal-based evalu- 
ation model supplemented by special 
studies to determine context and design 
variables that affect student 
performance; 

—The need to focus on answering evalua- 
tion questions that can provide guidance 
for program effectiveness and system 
design. 

As a result of the recommendations by the 
third party evaluators, the Illinois State 
Board of Education, in June, 1979, required 
that standardized procedures be developed 
for the determination of student eligibility 
and program participation in Transitional 
Bilingual Education Programs. 

In response to this recommendation, 
advice from various levels was elicited. 
Meetings were held with decisionmakers 
at the state and local levels. Program Evalu- 
ation and Assessment staff and individuals 
representing both downstate and Chicago 
-pfflQfa m s w er e in v olved - — 

In August, 1979, two outside contractors 
were, hired to provide technical assistarice 
to the State Board of Education by coor- 
dinating and organizing a state-wide inser- 
vice workshop on evaluation procedures 
for Illinois. Transitional Bilingual Education 
Program Coordinators and Directors. In ad- 
dition, the contractors were expected to ad- 
dress the areas of local district and state- 
wide data collection procedures in an at- 
tempt to refine the information procedures 
and to draft guidelines and appropriate 
data collection instruments. 
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On September 21. 1979. the state-wide 
workshop on bilingual education evaluation 
and reporting procedures for FY-80 was 
held. At this meeting the bilingual program 
directors were presented information on 
the recently developed uniform state-wide 
procedures for data collection and program 
administration. 

In December. 1979. the Illinois State Board 
of Education advertised for the newly de- 
signed position of Bilingual Program Eval- 
uator. The specific duties of the evaluator 
included: 

— Develop format procedures for evalua- 
tion and reporting; 

—Implement and coordinate the state-wide 
. evaluation system; 

— Deliver inservice and technical assis- 
tance to state and local educational 
agency personnel regarding bilingual 
student assessment and bilingual pro- 
gram evaluation; 

— Provide assistance to state education 
agency staff in drafting state plans, grant 
applications and responses to RFPs with 
respect to implementation of the evalua- 
tion component; 

—Prepare an annual Illinois State Board of Edu- 
cation report based on a synthesis of multiple 
sources of data which will be disseminated 
to the members of the General Assembly, to 
local districts, and to concerned individuals. 

On January 7, 1980, the Bilingual Program Eval- 
uator was hired and assigned to the Program 
Evaluation and Assessment Section. Since that 
time, a number of major evaluation activities 
have occurred. These activities can be summa- 
rized as follows: 

1) Development of evaluation reporting forms 
and procedures as part of a state-wide uni- 
form data collection procedure; 

2) Organization and presentation of five re- 
gional workshops on bilingual education; 

3) Provision of on-site technical assistance to 
state and local district personnel: 

4) Dissemination of information regarding 
bilingual evaluation to local, state and na- 
tional agencies: 

5) Collection of statewide data to be used in 
preparation of an annual report. 
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c. Current Activities and Special Projects 

Local districts have received inservice and 
technical assistance through the following ac- 
tivities and programs which provided an inter- 
dependent network of support services. 

i. Regional Evaluation Workshops on Bilin- 
gual Evaluation 

In March and April of 1979. five regional 
workshops were held. The workshops 
included an overview of the new state-wide 
data collection procedures with specific in- 
formation provided on form completion. 
Sessions were also presented'^on writing or 
revising District Assessment Procedures. 
English language evaluation and English 
language proficiency instruments. 

ii. Title IV National Origin Desegregation 
Project 

The Title IV National Origin Desegregation 
project, a grant award operating under the 
auspices of the bilingual section, has 
provided services to requesting districts 
throughout the state during the 79-80 
school year. These services have included: 

• On-site assistance in developing na- 
tional origin desegregation plans, 

• Inservice training for staff and com- 
munity as a result of the implementa-." 
tion of a national origin desegregation 
plan, 

• Inservice training for staff providing 
special education services including 
counseling and scheduling practices 
for national origin minority high school 
students. 

• Inservice training for parents of nation- 
al origin minority children who are re- 
ceiving special education services, 

. — ^Assistance in -planning appropriate 

special education curriculum for na- 
tional origin minority students. 

• Assistance in identifying appropriate 
placement language tests and assess- 
ment instruments for student 
placement. 

■ ( 

• /|^$jsistance in planning meaningful pro- 
grams of instruction and appropriate 
program models for national origin 
minority high school students. 
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Title VII SEA Technical Assistance Project 

The federally funded Title VII SEA Technical 
Assistance project, a grant award operating 
under the auspices of the bilingual section, 
has provided technical assistance to 
school districts in cooperation with the 
Bilingual Education Service Center and 
other outside agencies. Among the services 
provided are: 

• Visitation and monitoring of all Title V!l 
projects: 

• Sponsorship of a conference on bi- 
lingual/multicultural materials: 

• Sponsorship of state-wide meetings 
for bUingual program directors: 

• Sponsorship of / two regional parent 
conferences held at East Moline and 
Elgin and a stite-wide parent confer- 
ence held in^Ghicago: 

• Sponsorship of workshops on crosscul- 
tural communication, Asian curriculum 
adaptation, and proposal writing: 

• Organization of a colloquium series on 
special issues related to the education 
of limited-English proficient childreri; 

• Organization of meetings for admin- 
istrative staff on Rules ar^d Regulations 
and special education: 

• Coordination of the State and institu- 
tions of higher education efforts on 
bilingual teacher certification appeals. 

lA total of 120 bilingual program direc- 
! tors, coordinators, teachers and aides 
' attended the workshops. In addition, 
I persons attended who were responsi- 
1 ble for bilingual student assessment at 
J the local level, including principals, su- 
perintendents, regular classroom 
I teachers, school secretaries, and spe- , 
/ cial service personnel. Forty-two of the 
I state-funded districts sent staff to the 
; regional workshops. 

iv. /On-Site Technical Assistance Provided to 
/State and Local Educational Agency 

/ Personnel 

/ 

/ Information dissemination on the new 
/ state-wide data collection procedures has 
/ been very effective through direct contact 
/ and consultation with, atate and local 
/ agency personnel. Meetings have occurred 
between the bilingual evaluator and 
; SEA/LEA staff. Fifty-two of the 70 funded 
districts were visited. In addition, there 
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were numerous ryieetings and phone con- 
versations with Chicago District #299 pro- 
gram, technical- services and Research and 
Evaluation staff. 

Annual student report data from 70 local 
districts began Arriving at the State Board 
in June. Data clarification and editing were 
completed afterl contact was made with 
source districts.! Many site visitations and 
phone conversations were made in order to 
enable the necessary linkage to provide 
accurate data reporting. 

The cooperation between local education 
agency personnel and the bilingual evalua- 
tor has been good. The technical assistance 
sessions have resulted in product-oriented 
outcomes usually focusing on drafting of 
district-assessment procedures and/or de- 
velopment of student-assessment strate- 
gies that are local-district specific. 

The interactions between state education 
agency personnel and the bilingual pro- 
gram evaluatjor have been of two types: a) 
informational overview and b) inservice of 
bilingual educational specialists who serve 
in a consultative role to local districts. Pack- 
ets of infornrlation regarding the state-wide 
evaluation were distributed to bilingual 
education specialists and to the Illinois 
Resource Center consultants to aid them in 
their provision of technical assistance to Il- 
linois local districts. 

V: Indochina I Refugee Children Assistance 
Program 

The federally funded Indochina Refugee 
Children Assistance Program has operated 
under the auspices of the bilingual section. 
Supplemental educational services provid- 
ed for eligible Indochinese students by the 
State Board have included the following: 

• Adnhinistration of the Indochina Refu- 
gee! Children Assistance Program and 
thermitlal phases of the Trans ition Pro- 
gram for Refugee Children and Educa- 
tional Services for Cuban and Haitian 
Entrant Children Program; 

• Coordination and presentation of work- 
shops on topic of Indochinese lan- 
guages, cultures and materials for 
teachers, administrators, aides, etc.; 

• Participation in planning sessions for 
the forthcoming state-wide conference 
/"Meeting Education Needs of Refugee 
/Children in Illinois": 



• Dissemination of packets of Indochi- 
nese material and information to over 
60 different school districts and 
agencies; 

• Establishment of comprehensive Indo- 2. 
Chinese Resource . Library with over 

300 holdings including bilingual texts, 
teacher's manuals, cultural materials, 
maps, cassettes; dictionaries, area 
handbooks, bibliographies, etc. (Over 
100 agencies were contacted during 
the development of this collection.); 

• Act as liaison between the State Board 
and U.S. Department of Education on 
issues related to the Indochina Refugee 
Children Assistance Program; 

• Provision of Information and Referral 
Services, e.g., assisting school districts 
in identifying available resources to 
supplement local efforts to provide ser- 
vices to refugee students. 

Illinois Resource Center (IRC) 

The Illinois Resource Center (IRC) was es- 
tablished in 1972 and until 1975, operated 
as the Bilingual Education Service Center 
to "assist school personnel in the proper 
identification and educational assessment 
of students of limited-English proficiency 
and the establishment of appropriate in- 
structional programs of remediation, based 
on the needs of individual school districts. 
A professional staff of highly trained educa- 
tors provided direct assistance to teachers 
through workshops, consultations, and 
materials dissemination. IRC staff have de- 
veloped and housed the most comprehen- 
sive collection of instructional materials, 
resource documents, and research-based 
literature treating the education of limited- 
English proficient students in the nation. 

Throughout the year, several hundred 
teachers received assistance fom the IRC in 

a ..yatijely of education areas vyhich 

included: 

• A diagnostic/prescriptive interpreta- 
tion of individual assessment of the 
language minority student for effective 
instruction, 

/ • .A fostering of oral language and litera- 
•Cy development of the LEP student, 

• Classroom management techniques 
for the multilingual classroom, 

• Adaptation of content area instruction 
for the LEP student 



• Identification and assessment of LEP 

^ exceptional children and appropriate 

^ instructional techniques of 

, y remediation. 

Uniform Data Collection Procedures 

The'administrative procedures for bilingual educa- 
tion programs have undergone significant modifica- 
tions over the past years. The following components 
were developed in establishing a uniform state- 
wide procedure for program administration and 
data collection. 

The components are: 

a. PUBLIC SCHOOL BILINGUAL CENSUS 

b. PROGRAM APPLICATION 

c. STUDENT CUMULATIVE RECORD 

d. ANNUAL STUDENT REPORT 

Figure 1 shows a graphic representation of the inter- 
relationship between the program administration 
and data collection procedures. While each compo- 
nent is reported independently, the interdependen- 
cy among them is firmly established. Because the 
district assessment procedure is an integral part of 
each component, it is depicted as such. 

a. The Public School Bilingual Census 

The Public School Bilingual Census (see Figure 
2) has been designed to identify all non-English 
language background students and to distin- 
guish which of these students are eligible for 
program participation at the attendance center. 
Students are recorded, according to language, 
in one of three columns. 

Column A is provided for students with a non- 
English background who are attending classes 
at the attendance center. Column B is provided 
to distinguish those students reported in 
Column A whose English proficiency level is 
below average in aural comprehension, speak- 
ing, reading or writing in English. Column C is 
provided to distinguish those students reported 
in Column A whose English proficiency level in 
aural comprehension, speaking, reading and 
wrltingls equal to or above average. 

In order to conduct the census, each local edu- 
cation agency must have developed a district 
assessment procedure. The quality of the dis- 
trict assessment procedure reflects the degree 
to which districts are able to appropriately 
identify and evaluate students from non-English 
backgrounds. 

b. The Bilingual Program Application 

The Application for Transitional Bilingual Edu- 
cation Program (see Figure 3) is designed to 
provide program descriptions at the attendance 
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center level. It is the mechanism whereby dis- 
tricts may request state reimbursement for ser- 
vices provided to students identified in Column 
B of the Bilingual Census. It also provides local 
districts with the opportunity to request that 
some eligible students (as identified in Census 
Category B) be exempted from program partici- 
pation. This process is known as Waiver of Pro- 
gram Participation. It is checked against the re- 
sults of the needs assessmentrthBTwas-per- 
formed by the district juxtaposed with the in- 
structional program which would be provided 
the student in lieu of bilingual education. 

Information to be recorded includes a personnel 
summary and a program summary by language 
group for each attendance center in the district. 
Program budget information is listed according 
to funding source for each expenditure catego- 
ry. JJhe application also contains the district as- 
sessment procedure, which is a description of 
the overall decision-making process through 
which program entry and exit decisions are 
made. 



c. Student Cumulative Record . 

The Student Cumulative Record (SCR) (see 
Figure 4) is designed to provide local districts 
with a vehicle for collecting information about 
students at each attendance center. The Stu- 
dent Cumulative Record remains in the district 
and serves as the basis for required end- 
of-the-year reporting. This document has been 
d^esTghe^to ToHbw tlT(^^ 
year period. Recorded on this form are data on 
individual student progress in English, home 
language, and subject matter skills. Also record- 
ed are data on program entry and exit. 

The Student Cumulative Record is a form that 
districts may modify according to local needs. 
The Chicago Public School System, for exam- 
ple, collects and reports more information than 
that contained on the SCR and chooses to uti- 
lize a cumulative record idiosyncratic to their 
district. 



FIG. 1 
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d. Annual Student Report 

The Annual Student RepokiASR) (see figure 5) 
is a computer-generated turnaround document 
which is submitted to the St^te Board of Educa- 
tion. Local districts may conrk>ile student data 
from their student cumulative records. This 
report provides the basis for state-wide evalua- 
tion activities to assess IndiyiduaXan^^^ 
gram effectiveness. Individual stuoient informa- 
tion is reported on the ASR by studdnt identifi- 
cation number. It includes scores \)n English 
language proficiency instruments, attendance, 
and other programmatic information. 

Some large districts, whose computer fa\;ilitltes 
are easily accessible, have chosen to subnrjt 
annual student report data on magnetic tape or 
disc for their large numbers of students.XAX 
\ reporting format is provided those districts 
\ along with data edit criteria. Special provisions 
\are made between the Data Control Section of 



the State Board and districts submitting via 
tape or disc. 

In the spring of 1981. all districts received pre- 
printed Annual Student Report Forms based on 
student information reported during the previ- 
ous ye^r To expedite current-year reporting, 
identification numbers, language code, entry 
data years ia other programs, and grade 4evet 
incremented by one year were preprinted. Pro- 
gram staff needed only to add current, updated 
information, rather than to duplicate the previ- 
ous efforts. 

Concurrent with receipt of the preprinted 
forms, districts received an aggregate summary 
of their district's bilingual program information. 
This turnaround data is expected to be useful in 
allowing districts to present information to 
local decision makers requesting an accurate 
statistical summary. 
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3. Major Findings 

This capsule summary highlights the major findings 
from the data collected on students enrolled in 
downstate and Chicago transitiorwil bilingual educa- 
tion programs in Illinois. The majority of Illinois' 
143.471 students froi^-QOn-English backgrounds 
come from Spanish-spealclna backgrounds. Bilin- 

gual Census fiflurea indicate, tmt 93J14x„QrJ.6% Qf . 

students, are of Spanish backgrounds. The next 
- largest language background groups represented 
by Illinois' students are Korean (4.740. or 3.3%); 
Arabic (2.405. or 1.7%): Vietnamese (1,637. or 
1.1%); and Assyrian (1 .1 43. or .B%). 

A total of 49.645 students from non-English back- 
grounds were identified in Illinois as achieving 
below age or grade level in listening, speaking, read- 
ing or writing in English, based on local district as- 
sessment procedures. This figure, representing 
34.6% of Illinois' students from non-English back- 
grounds, is reflective of the students eligible for 
transitional bilingual education program services. 

The number of students participating in transitional 
bilingual education programs in Illinois totalled 
41.966. Sixty-nine downstate programs accounted 
for 6.B73 students, or 16.4% of the total; the Chica- 
go program accounted for 35.093 or 83.6% of the 
total students participating in programs. 

Of those students who were eligible for program 
services. Chicago and downstate programs served 
84.5% from non-English backgrounds. Those not 
participating in transitional bilingual education pro- 
grams were not included due to one of the following 
reasons: denial of parental permission, placement 
in another program of instruction deemed more ap- 
propriate for the child (such as special education), 
and attendance center location factors. A transition- 
al bilingual education program is mandated only for 
students from a non-English background who share 
a common language with 20 or more students in an 
attendance center. 

Students in Illinois' transitional bilingual education 
programs were identified and assessed for program 
participation, placement and exit based on 24 dif- 
ferent language proficiency and dominance tests. 
11 standardized reading tests and 7 standardized 
achievement tests. Twenty-two percent of all stu- 
dents (9.532) were administered pre- and post 
tests, while 19.841 or 47.3% were administered at 
least one language or achievement instrument. A 
list of instruments recommended for reporting 
transitional bilingual education student assessment 
data has been compiled by the State Board of Edu- 
cation and disseminated to local districts. District 
staff have participated in numerous inservice oppor- 
tunities related to testing and assessment. It is ex- 
pected that the number of students assessed will in- 
crease in subsequent years as technical assistance, 
inservice and district assessment procedures 
improve. 



Illinois transitional bilingual education progranis 
reported daily minutes of instruction. State data il- 
lustrate that these students spent an average of 33 
minutes in English as a second language instruc- 
tion. 108 minutes in native language instruction, 
and 1 50 minutes in English instruction. 

Positive achievement gains were reported for stu- 
dents participating in Illinois^ transitional bilin^^^^ 
'education pr^^ 

cy and dominance tests, downstate students were 
found to have raised their proficiency level by one 
category (on a language proficiency scale of 1-5). a 
considerable gain in one program year. Scores 
based on the Iowa Test of Basic S/r/Z/s. a measure of 
reading achievement, indicate that Chicago's transi- 
tional bilingual education students scored better 
than expected compared with English-speaking 
cohorts in compensatory programs. Furthermore. 
Chicago's students were acquiring English compre- 
hension and speaking skills while developing basic 
reading skills. 

A total of 7.236. or 1 7.2% of the total number of stu- 
dents participating in transitional bilingual pro- 
grams, exited from the program. These students left 
programs by successfully transitioning, dropping 
out of school, voluntary or involuntary program 
termination, or for other reasons. Exit rate, unlike 
successful transition rate, reflects the yearly student 
turnover. The downstate exit rate was found to be 
19.7%. while the Chicago exit rate was calculated 
at 17.0%. 



Transitional bilingual education programs in Illinois 
successfully transitioned 6.118 students. This 
figure represents a successful transition rate of \ 
14.6%. state-wide. Downstate programs successful- \ 
ly transitioned 8.1% of the students participating in 
programs, while Chicago successfully transitioned 
15.9% of their students. 

a. Testing Instruments: 

Language Proficiency and Dominance 

Implicit in the assessment process is a mecha- 
nism for decisionmaking regarding student 
identification and evaluation in the seventy 
State-approved bilingual programs. While the 
request for information needed for district as- 
sessment procedures did not specifically re- 
quire utilization of commercially developed 
testing instruments, all but five distrirts used at 
least one commercially developed instrument. 
Approximately 20% of all districts utilized 
teacher-made vocabulary, syntax, comprehen- 
sion or production tests. 



The commercially developed tests for assess- 
ment of English language skills were divided 
into three categories: a) language proficiency / 
tests, b) standardized achievement tests, and c) 
standardized reading tests. A list of test instru- 
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merits, by test code, was provided to districts 
and included in the instructions for completing 
the Annual Student Report. 

Of the thirty-three lant Jage proficiency tests 
listed on the instructior. sheets, twenty-three 
were utilized by the seventy districts. The fact 

cyTests were used complicated the procedure 
for determining language proficiency gains be- 
cause of the variability of the instruments. How- 
ever, of the twenty-three instruments utilized, 
six instruments accounted for students in ap- 
proximately 75% of the districts: 

Boehm Test Of Basic Concepts (BTBC) 
Functional Language Survey (^LS) 
Interamerican Tests 

James Language Dominance Test (JLDT) 
Language Assessment Scales (LAS) 
Peabody Picture Vocabulary Tesr (PPVT) 

Because language proficiency tests are not 
often used by general public school educators, 
a brief description as found in A Guide to As- 
sessment Instruments for Limited- 
English-Speaking Students (Reynolds & 
Sisson. 1978) follows: 

i. Boehm Test of Basic Concepts: 

A diagnostic instrument used to measure 
the child's mastery of concepts related to 
space, time, and quantity. There are 25 
pictorial items which are arranged in order 
of increasing difficulty. The examiner 
makes a brief statement about each item 
and asks the students to choose the picture 
which best corresponds. Students respond 
by marking answers in their test booklets. 
The test may be individual or group admin- 
istered and is appropriate for grades K-2. 
Approximately 20 minutes is required fo/ 
test administration. 

ii. Functional Language Survey: 

An individually adninistered survey of stu- 
dents' ability in comprehension ^nd pro- 
duction of English. It consists of a, total of 
15 questions in 3 sections: 1) Comprehen- 
sion, 2) Production/Repetition, and 3) Com- 
prehension/Production. The FLS is ap- 
propriate for 1st through 12th grades and 
takes /-1 5 minutes to administer. 



iii. Interamerican Tests: 

A norm-referenced instrument which as- 
sesses vocabulary and reading comprehen- 
sion skills. There are 3 levels: 80 items on 
Level 1, 110 items on Level II. and 125 
items on Level III. For Levels I and II. stu- 

dants .respond by marking ensyyers in t^^^^^ 

booklets. For Level III, students use a separ- 
ate answer sheet. This test is group admin- 
istered. AdministratiOfi time is 20-25 mi- 
nutes for Levels I anjd II and 41 minutes for ' 
Level III. The grade range is 1-13. 

iv. James Languag^bominance Test: 

A diagnostic ijr^strument used to measure a 
students' receptive and expressive vocabu- 
lary ability. /A total of 40 parallel items are 
included in Spanish and English. Students 
respond ;6rally. This test is individually ad- 
ministered and administration time is ap- 
proxiniately 10 minuses. The test is ap- 
propriate for grades K-1. 

V. Language Assessment ocales: 

A diagnostic instrum^jnt containing 100 
items designed to asses'j phonemic pro- 
duction and discrimination, vocabulary 
production, sentence comprehension, oral 
production skills, and a student's ability to 
use language to attain specific goals. In- 
structions are given orally, and item stimuli 
are either taped or pictured in the test 
booklet. Students respond orally or by 
pointing. The test is individually 
administered. A language arts supplement 
containing follow-up learning activities 
and language games related to each test 
item is available. The test takes 20 minutes 
to administer in English and 20 minutes to 
administer in Spanish and is appropriate 
for grades K-1 2. 

vi. Peabody Picture Vocabulary Test: 

An instrument used to provide an estimate 
of the subject's understanding of English 
vocabulary words. The examiner says a 
word for each of the 1 50 picture sets (4 pic- 
tures in each set) in the series, and the stu- 
dent responds by pointing to the correct 
picture. The test takes approximately 15 
minutes to administer and is appropriate 
for grades pre-K through adult. 
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THE ASSESSMENT OF LANGUAGE MINORITY STUDENTS: 
CURRENT TRENDS IN THE STATE OF ILLINOIS 

by 

Maria Medina-Seidner 
Illinois State Board of Education 



Background 

Illinois has actively supported bilingual education pro- 
grams for almost a decade. Beginning with $200,000 ap- 
propriated by the General Assembly in 1970-1971 to 
fund an experimental program in five Chicago schools, 
the state-funded experimental program (mandated 
since 1976) presently provides bilingual instruction in 
over 30 languages to 40,000 children of limited-English 
proficiency (LEP) In the Chicago public schools and B3; 
school districts throughout the state. The appropriation 
level in FY 80 is $1 6.5 milfion. 

As in most states mandating bilingual education, the Illi- 
nois law requires the establishment of a transitional 
bilingual program. Student participation in the program 
is limited to three years "or until such time as (the stu- 
dent) achieves a level of English language skills which 
will enable him to perform successfully in classes in 
which instruction is given only in English, whichever 
shall first occur" (School Code of Illinois. Article 14-C-3). 

Questioning the steady increase in the number of stu- 
dents enrolled annually m transitional bilingual educa- 
tion programs and the corresponding increase in the 
annual appropriation request, the General Assembly 
has begun to question the effectiveness of the program. 
Specifically, legislators want to know: 

1 Are students in bilingual education programs learn- 
ing English? 

2. Are students "transitioning" out of the bilingual pro- 
gram into all-English classes? 

3 How many students exit the program each year and 
for what reasons? 

Ther.e concerns led the Genaral Assembly to request an 
outside evcluation of the Illinois Transitional Bilingual 
Education Program. In June. 1979. L. Miranda and As- 
sociates presented their findings to the Illinois itate 
Board of Education. The evaluation determined Vhat 
bilingual programs were meeting their objectives and 
that between 1 7 percent and 24 percent of students en- 
rolled in bilingual programs exited the program each 
year The evaluation also pointed to the need to estab- 
lish a management information system to maintain 
complete, accurate, uniform, and comprehensive pro- 
gram data at the state level. 



In response to these recommendations as well as to the 
increasing demands for program accountability on the 
part of the General Assembly, the Illinois State Board of 
Education has undertaken a complete revision of its 
annual census procedures (including the determination 
of student eligibility and program participation in Transi- 
tional Bilingual Education Programs) and has developed 
statewide procedures for data collection and program 
administration. 

Student Assessment and Data-Gathering Procedures 

During the 1979-BO school year, the Bilingual Education 
Section. Program Evaluation and Assessment Section, 
Research and Statistics Section, and Data Management 
Section of the Illinois State Board of Education have 
worked cooperatively to develop and implemi3nt ne^ 
procedures for data collection and program administra- 
tion. These procedures consist of the following compo- 
nents: (a) Public School Bilingual Census, (b) Program 
Application, (c) Student Cumulative Record, and (d) 
Annual Program Report. (Sample forms are available 
from: Illinois State Board of Education, 100 North First 
Street. Springfield, Illinois 62777) 

Public School Bilingual Census. According to the 
School Code (Art. 14-C-3): 

Each school district shall ascertain not later than 
the first day of March, under regulations prescribed 
by the Supehntendent's Office, the number cf child- 
ren of limited English-speaking ability within the 
school district and shall classify them according to 
the language of which they possess a primary 
speaking ability and their grade level, age. or 
achievement level. 

The Public School Bilingual Census has Seen designed 
to identify all non-English language background stu- 
dents and to distinguish which of these students are 
eligible for program participation. Census forms are dis- 
tributed annually to all school districts in January and 
are due at the Illinois State Board by March 1. Each at- 
tendance center must fill out a form. Students are 
recorded according to language in one of three 
columns. Column A is provided for students with a non- 
English backj^i ound who are attending classes at the at- 
tendance center Column B is provided lor studer^ts 
reported in column A whose English proficiency level is 
below average in auial comprehension, speaking, read- 
ing, or writing in English as determined by district per- 
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sbiinel in accordance with a state-approved district as- 
sessment procedure. Column C is provided for students . 
reported in column A whose English proficiency level is 
equal to the state-approved norms or average. 

'■ !n order to conduct the census, each LEA must have de- 
veloped a district assessment procedure. This procedure 
must contain: 

(a) A description of the district procedure for the 
identification of students with a non-English 
background specifying the instruments 'or 
other assessment strategies used, including 
the individual(s) responsible for implementing 
the procedure and the training to be received 
by the person(s) who will perform the identifica- 
tion. For example, STUDENT IDENTIFICATION 
must include: PROCEDURE for identification of 
non-English background students. 
INSTRUMENT specification. WHO assesses stu- 
dents from non-English backgrounds. 
TRAINING of the assessor. 

(b) A description of the district procedures for eval- 
uating the English proficiency of students 
whose first or native language is English, spe- 
cifying the instruments and/or procedures to 
be used, including the person(s) responsible for 
the English proficiency evaluation and the time 
the evaluation will occur. For example. 
EVALUATION must include: PROCEDURE for 
f valuation of native English-speaking students' 
English proficiency. INSTRUMENT specifica- 
tion. WHO assefsses English language proficien- 
cy of native English speakers. TIME of 
evaluation. 

(c) The average English proficiency, performance, 
or achievement level by grade or age equivalent 
for students whose first or native language is 
English. For example, PROFICIENCY LEVELS 
must include: STATEMENT of local average 
proficiency levels for EACH age or grade level 
for native English-speaking children. 

This procedure is reviewed by the Bilingual Section and 
Evaluation Section staff at the Illinois State Board of 
Education. If the district assessment procedure is unsa- 
tisfactory, technical assistance is provided. Since this 
was the first year of this procedure, five regional techni- 
cal assistance workshops for district evaluation staff 
^were conducted prior to the census. These sessions 
helped districts not only in the pi-eparation of their as- 
sessment procedures, but also in improving their knowl- 
edge of assessment instruments. 

The new student assessment requirements also reflect 
the State Board's position as stated in their newly adopt- 
ed Policy Statement on Bilingual Education: 

The State Board of Education believes that the 
educational needs of eac/?student limited in English 
proficiency should be met. The Board shall approve 
the standards by which the district determines the 
Q eligibility of its non-English background students 

ERIC 



for transitional bilingual education programs and 
services. The required primary criterion for eligibility 
and successful program completion for each non- 
English background student shall be an acceptable 
measure of English proficiency as compared with 
peers whose first or native language is English. I 

The Program Application is prepared by each school 
d^^trict planning to conduct a bilingual education pro^ 
gf*am the following school year. Tt is due May 1. The Pro-- 
gram Application is designed to provide program de- 
scriptions at the attendance center leveL The number of 
students must correspond to the number given on 
census column B; otherwise, an explanation must be at- 
tached. The application provides the LEA with the op- 
portunity to fiequest that some eligible students be ex- 
empted from program participation. However, an expla- 
nation must be attachOc^ including a description of the 
needs assessment that vvas performed artd the instruc- 
tional program which will be provided instead of 
bilingual education. Information is recorded which in- 
cludes a personnel summary and a program summary 
by language group for each attendance center imthe 
LEA. The application also contains the district assess- 
hnent procedure. 

The Student Cumulative Record \s designed to provide 
local districts with a vehicle for collecting information 
about studenta at each attendance center. The Student 
Cumulative Record remains in the district and serves as 
the basis for required end-of-the-year reporting. This 
document has been designed to follow the student over 
a three-year period even if the student transfers to 
another school. Provisions are also made for a fourth or 
follow-up year. Recorded on this form are data on indi- 
vidual student progress in English, home language, and 
subject-matter skills. Important program information, 
such as minutes of English as .a second language (ESL) 
instruction per day. minutes of instruction per day using 
the native language, minutes of English instruction per 
day (excluding ESL). and total days present during the 
year, is also recorded. Also recorded are data on pro- 
gram entry, e.g. entry date, years in other programs and 
exit (e.g. exit date and exit code). The bilingual program 
exit codes are as follows: 

1. Transitioned: Student able to perform success- 
fully in an all-English classroom as determined 
by district exit criteria. 

2. Involuntary program termination: Student has 
moved or been promoted within the district to 
an attendance area which does not require or 
offer a bilingual program. 

3. . Voluntary program termination: Student has 

withdrawn from the program at the request of 
the parents. 

4. Dropped out of school: Student has dropped 
out of school, but still resides within the school 
district. 

5. Other: Student has either withdrawn from 
school and moved to another district or his/her 
status is unknown. 
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The Annual Student Repon is a computer-generated Z 
turn-around document which is submitted to the State 
Board of Education in July. LEAs are expected to 
compile student data which will provide a basi& for 
statewide evaluation activities to assess overall program 
effectiveness. Individual student information is reported 
by student ID number and includes scores on English 
language proficiency, instruments used, attendance 
and other programmatic information, e.g. the amount of 
time spent daily on ESL instruction, native language, 
and English language instruction. 

A comprehensive evaluation report based on informa- 
tion gathered from the Bilingual Census, the Program 
Application, and the Annual Student Report is to be pre- 
pared annually and disseminated to members of the 
General Assembly, local education agencies, institutions 
of higher education, parents, community representa- 
tives and other concerned citizens. The report will in- 
clude a descriptive section which will emphasize the 
unique programmatic characteristics of each bilingual 
program in IMinois and a quantitative section which wiM 
address these evaluation questions: 

1. What is the total number of identified LEP students ^ 
by language who were adequately and appropriate- 
ly served through state-funded transitional bilingual 
education programs during the preceding school 
year? In order to demonstrate the extent to which 
students were served or underserved, one or more 
of the following will be included: pupil/teacher 
ratio, full-time equivalency ratios, type of teacher 
endorsement program model type (self-contained., 
pull-out. etc.). 



2. What is the total number of identified LEP students 
by language who were underserved through state- 
funded Transitional Bilingual Education Programs 
during the preceding school year? 

3. What is the total number of identified LEP students 
by language who were not served through state- 
funded Transitional Bilingual Education Programs 
during the previous school year? 

4. What entrance criteria are utilized to determine stu- 
dent participation in the Transitional Bilingual Edu- 
cation Progr-am? 

5. How many students left the Transitional Bilingual 
Education Program during the preceding school 
year? 

6. Of the students leaving the Transitional Bilingual 
Education Program, what were the reasons? 

7. How many students were exited (successfully tran- 
sitioned) and found able to perform successfully in 
an all English classroom as determined by district 
exit criteria? 

8. To what extent do students in the Transitional 
Bilingual Education Program show evidence of 
progress in English language skills? 

9. How many minutes per day of ESL instruction did 
students in Transitional Bilingual Education Pro- 
grams receive? 
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POLITICAL EXPEDIENCE OR EDUCATIONAL RESEARCH? 



An Analysis of Baker and deKantor's 
Review of the Literature of Bilingual Education 



by 

Stanley S. Seidner, Ph.D. 
National College of Education 



Introduction 

Baker and deKanter's study appeared in a flurry of con- 
troversy, as at least one newspaper declared ih bold 
headlines "Studies Disavow U.S. Focus on BilinOtial Edu- 
cation"* (Washington Post September 29, T981). The 
staff writer began the article by stating "Controversial 
new Department of Education studies" cohcluded that 
"the federal government no longer should focus its ef- 
fort? to aid non-English speaking students primarily in 
isid bilingual education because there/i$ little evidence 
those progrannis work..." Ub/d» p. A6)yCunous to deter- 
mine iif this was yet another case 0/ misrepresentation 
by the media of sound educationa/research, I obtained 
a copy of the study (Baker and deKanten 1981). After a 
careful analysis, I sadly conclude that the study reflects 
a trend of prostituting educational research for possible 
political and economic expediency. 

;,The problems which plague the potential development 
and execution of a research design are well-known and 
too extensive to sustain a worthy recapitulation (See 
Best, 1977; Stanley, 1967: Thorndike, 1971, 1968; EbeL 
1967: Popham, 1975, Ennis, 1973, among others). The 
actual development and implementation of a research 
design according to scientific canon implies the greatest 
possible reduction of bias and a "fair hearing" of data. 
Difficult as it becomes to do so with primary data, the 
application of this type of approach is more so in ap- 
proaching data from a secondary perspective. A hidden 
agenda of a political nature further compromises this 
approach to the point where the attempt at rigorous re- 
search metamorphoses into statements of support for 
public, policy recommendati ')ns. Examining with care 
the authors' methodology, I am convinced that Baker 
and deKanter's review is an example of this process. 
This paper presents an analysis of theirreview, with par- 
ticular attention given to the authors' methodology and 
conclusions. We begin with an overview of their re- 
search design. 

This paper was written as an iridependent analysis of a rebort 
ployed by the Office of Planning, Budget and Evaluation, L 



General Research Approach 

Baker and deKanter pose two research questions which 
they claim are "derived from the principal intent of 
Federal policy..." (Introduction, p. 1)."* 



1. Does transitional bilingual education lead to 
better performance in English? 

2. Does transitional bilingual education lead to 
better performance in nonlanguage subject 
areas? 

The authors state that they intended to limit the discus- 
sion of their review to the two questions (ch. 1, p. 8). 
They contend to have reviewed "more than 300 docu- 
- ments," of which 150 were program evaluations. Most 
of the studies were Title VII evaluations, supposedly rep- 
resenting every region rjv the country. They also claim 
that studies were identified by means of an "ERIC 
search* by consultation with experts in the field, from 
prior reviews, and from lists of studies." Although they 
insist on having "covered the major studies." Baker and 
deKanter acknowledge that they were unable to obtain 
copies of pre-1978 evaluations from the Office of 
Bilingual Education. Nevertheless, the authors express 
the belief that their review is the most comprehensive 
undertaken to date. Only 28 studies were judged by the 
authors to apply to their concerns and methodological 
criteria. In essence* their cpnclusions and recommenda- 
tions for national policy are based upon these 28 select- 
ed studies. It is unfortunate that the authors failed to 
fully develop the application of their criteria by briefly 
citing their reasons for rejecting each of their reportedly 
examined 300 or so studies. Although they claim limita- 
tions of time, it becomes difficult to determine the quali- 
ty of attention Baker and deKanter allegedly expended 
in their total review. Their criteria for rejection included: 
•'\ 

i. Rejected studies lacked random assignment to 
treatment and comparison groups* as well as 

by Keith A. Baker and Adriana A. deKanter, who are em- 
.S. Department of Education. 
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control for potential initial differences between 
groups. 

ii. Studies failed to address the issues which 
Baker and deKanter claimed were under 
consideration. 

iii. The studies employed a norm-referenced 
. design. 

iv. Studies used comparison of post-test scores 
only with nonrandom assignment. 

V. There was a reliance upon school-year gains 
for the program group without a control group. 

vi. There was a reliance upon grade-equivalent 
scores. 

A number of criteria (for example, vi) can be debated 
with arguments for and against their inclusion. On the 
surface, it would seem as if the authors are adhering to 
rigorous research methodologies. However a closer 
look shows otherwise. An analysis of Baker and deKan- 
ter's paper has elicited the following concerns: 

1 . The first concern is the authors' improper definitions 
of terms. Through their glossary of selected terms 
and some explanations in the text. Baker and deKan- 
ter attempt to set the frame of reference. which 
would aid them in approaching the research ques- 
tions. Serious problems develop, as we will present- 
ly see, with a number of terms. The authors present 
their definitions of terms such as Transitional 
Bilingual Education (TBE), English as a Second Lan- 
guage (ESL), Structured Immersion, and Submer- 
sion (Baker and deKanter, Introduction, pp. 2-3; ch. 
1, pp. 5-8). In their methodology of creating opera- 
tional definitions, the authors would have to 
assume that their terms are equally applicable to 
the selected studies. It would appear that Baker and 
deKanter failed to check the validity of their defini- 
tions with the reality of practices in programs 
which they selected or to correct for any potential 
inconsistencies. (Their definitions fail to account for 
such differences as time and treatment, for exam- 
ple.) The authors' definitions of TBE, for instance; in- 
forms us that subject matter "is at least partially 
taught" in until a successful transition can be 
made to (ch. 1, p. 6). Unfortunately, the vagueness 
attached to their interpretation of the essential 
nature of this definition is useless for rigorous 
regard and consistent measurement approaches. 
Balasubramonian, et. al. (1973), may have addressed 
a number of schools with models corresponding 
more readily to maintenance bilingual/bicultural 
education. (Illinois implemented their mandate for 
TBE in 1976). Baker and deKanter fail to reconcile 
potential differences in their definition of such varia- 
bles as time and treatment from this study and 
others, like Carsrud and Curtis (1980), Cohen 
c (1975), Covey (1973), Kaufman (1968), or Ramos, et 
al. (1967) among others. 

O 
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A similar problem arises with Baker and deKanter's 
definition of immersion. The authors overlook that 
the St. Lambert project (Lambert and Tucker, 1972) 
was designed to promote functional bilingualism . 
through the vehicle of a home-school language 
switch (also Bruck, Lambert and Tucker, 1974). It 
would also appear that the authors of the Canadian 
studies, as well as others associated with, the pro- 
gram, have defined immersion in terms of bilingual 
education, with the goal of promoting dual lan- 
guage facility (for example. Swain, 1972; Tucker, 
Lambert and DAnglejan, 1973; Lambert, .1974; 
Cohen and Swain, 1976; Swain and Barik, 1973). 
Here again, faulty research design is apparent in 
Baker and deKanter's reconcilation of their defini- 
tion of "structured immersion" and their selection of 
studies (ch. 1 , p. 6, ch. 2, pp. 62- 72). Again, there is 
no evidence that the authors considered differences 
in approaches to the implementations of programs 
by Lambert and Tucker (1972), Barik and Swain 
(1975), Barik, Swain and Nwaninobe (1977), and 
Pena-Hughes and Solis (1980). If the authors 
desired to propose an operational definition, they 
were unsuccessful in convincingly describing 
procedures. 

Another instance of potential problems arises with 
the authors' definition of ESL There is no evidence 
to indicate, here again, that Baker and deKanter 
took under consideration similar concerns. Had 
they considered the distinctions in studies between 
students learning English as a second language 
from the availability of ESL methodologies? Which 
ESL methodologies, if any, are under consideration 
(i.e. behavioral, cognitive, eclectic)? Again, we are 
given no indication. In short summarization. Baker 
and deKanter's approach to their definitions is less 
than acceptable as a durable component of their re- 
search design. Young and Comtois relate the follow- 
ing (1979): 

Research utilization is an extraordinarily com- 
plex phenomenon. Yet it can be analyzed like 
any other social phenomenon through a pro- 
cess of conceptualizing variables, defining 
them, and operationalizing them in a research 
setting. And as with other phenomena, clear 
definitions are necessary to make possible com- 
munication on the topic. Without such clarity, 
one cannot make sense out of the otherwise 
conflicting assessments of such things as the 
extent of program evaluation use and what fac- 
tors seem associated with that use (p. 64). 

The authors' definitions could not fail to influence, 
in turn, the extent of their own research (contingent 
upon what Caolan, 1977 believed to be the concep- 
tualizations of use and research; also Babbie, 1979, 
and Weiss, 1977, 1978, Thorndike and Hagen, 
1969). It would then follow that the remainder of 
Baker and deKanter's methodology might also be 
subject to scrutiny. 
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2. A second concern is the authors' attempt to play 
the role of rigorous researchers. Baker and deKanter 
apparently feel in a general sense that technically 
flawed studies are worse than no studies at all. 
There is a general lack of consensus regarding eval- 
uation methodology. Some evaluators adhere to 
principles of rigorous research, while others con- 
tend that such processes exclude, in their rigidity, 
approaches to gathering other information (See 
Young and Comtois. 1979). The application, in 
general, of a rigorous research approach to a secon- 
dary study becomes problematic. This approach to 
a review of literature subordinates, in premise, the 
methodologies and findings of selected and rejected 
studies to strict adherence of scientific canon. The 
principle in itself would appear somewhat palatable, 
if Baker and deKanter had taken the time and care 
to equitably apply their criteria. Their approach 
would seem convenient in eliminating a number of 
studies which would potentially refute their argu- 
ments. Consider the statement made by the primary 
researcher of a rejected study that Title VII Basic 
Programs "are primarily geared to equalize educa- 
tional opportunities." Accordingly, such programs 
"are not designed to. nor controlled by the strictest 
canons of scientific research" (Leyba. 197B. p. 6). 
The resulting few studies which survive the authors' 
gauntlet become too paltry in number and diverse 
in methodology and research intent to even afford 
serious conservation of Baker and deKanter's re- 
search questions. One of the basic tenets of educa- 
tional research was expounded by Thorndike 
(1973): 

We must always remember that any test, or 
any other type of behavior observation repre- 
sents only a limited sample from some domain 
or behavior It represents the domain imperfect- 
ly, and the scoje it produces is only an approxi- 
mation to the score that the individual would 
get for the whole domain. . .(P.5B) 

The observations of individual researchers represent 
limited domains imperfectly and should be viewed 
within the stated designs of the particular studies. 
The face-value argument of the authors might run 
along the lines that policy making dictates current 
answers based Upon available data. The role of 
rigorous researcher, as it is selectively applied by 
Baker and deKanter. is ludicrous by virtue of their 
failure to recognize differences of research purposes 
and methodology among studies. (We will see evi- 
dence of this In the following subsections.) 

3 An outgrowth of the preceding subsection is a con- 
cern with the authors' methodology in attempting 
to reconcile the selection of data to their research 
questions. Baker and deKanter select Kaufman's 
study (196B). for example, to test the effectiveness 
of TBE presumably against submersion. The pur- 
pose of Kaufman's study .is generally conveyed by 
the title. "Will Instruction in Reading Spanish Affect 
Ability in Reading English?" There is ho evidence to 
suggest that the study was designed to test the 
general effectiveness of TBE in comparison to the 



author^' definition of submersion or to specifically 
answer Baker and deKanter's research questions. 
An analysis of their secondary review, however, 
shows attempts by the authors to "bend" the data 
to fit their research questions. At best, the collection 
of studies indicates a review of literature which 
touches upon their research questions by selective 
interpretation and inference — not by substantial 
research which has been collected by original and 
careful design. For example. Baker and deKanter 
cite one group in Kaufman's study (group A), which 
received more instruction in Spanish than a com- 
parison group, as having 'showed no English read- 
ing improvement'' (p. 18. By their own admission, 
more time was alloted to primary speakers of Spa- 
nish in their native language than in English.) It 
would seem that Baker and deKanter classify Kauf- 
man's study within their definition of TBE because 
of the use of Spanish. Why would the authors 
prefer this categorization, instead of perhaps immer- 
sion? One would appear as arbitrary as the other A 
less generous reviewer might categorize Baker and 
deKanter's attempts as "charlatanic." rather than re- 
flecting misinformation. 

It is also intriguing that Baker and deKanter praise 
as a strength of Kaufman's study "the use of covari- 
ance to control for pre-existing differences" in 
verbal and non-verbal 10. among other factors 
(p. IB). Discerning the authors' rationale is further 
confusing, given their citation on 1.0. testing and 
"innate language ability" (ch. 1. p. 16. f. 9: also see 
their definition of "correlation", p. 2). The authors 
are too preoccupied with statistical tests to consid- 
er any valid relationship of intelligence testing to 
Kaufman's primary intentions or to their own re- 
search questions. (See my concerns bciow on statis- 
tical misuse.) If the authors contend thare is a valid 
relationship to either, they fail to provide them. If 
there isn't, why would Baker and deKanter mention 
the utilization of covariance as a strength of the 
study within an irrelevant framework? The Balasu- 
bramonian. Seelye and DeWeffer study (1973) is 
another example of misapplied research. The origi- 
nal intent of the three authors is reflected in the title 
of their study. "Do Bilingual Education Programs 
Inhibit English Language Achievement? A Report 
on an Illinois Experiment." Here again, a distinction 
is made between the purposes of Balasubramonian. 
Seelye and DeWeffer's study and Baker and deKan- 
ter's loose interpretation. The target programs were 
bilingual/bicultural. corresponding in many in- 
stances to a maintenance philosophy. Transitional 
Bilingual Education as a State mandate in Illinois, 
was not implemented until 1976. Baker and deKan- 
ter claim, as a shortcoming of the study, a failure to 
present data "on progress in nonlanguage subjects." 
This presentation of data, however, was not the in- 
tention of Balasubramonian. Seelye and DeWeffer 
since their study focused on a language concern. 
One could question the motives of Baker and deKan- 
ter in setting up a possible "straw dog" to knock 
down. Of greater concern, however, is the careless 
methodology employed by the authors in present- 
ing pre^mably important research issues. 
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Baker and deKanter continue this trend in their se- 
lection process, posing serious questions as to the 
soundness of their methodology (for example. 
McConnell. 19B0a, 19B0b: Ramos. Aguilar. and 
Sabayan. 1967).^ A more detailed analysis Of Baker 
and deKanter's selection processes may provide an 
inquisitive graduate student with a research topic 
for a Master's thesis. 

4. • One of my strongest concerns is Baker and deKan- 
ter's apparent selective application of their criteria. 
It would appear that Baker and deKanter are 
prophetic in their observation that an argument 
might be made that they "applied arbitrary criteria 
and personal judgment" (ch. 1, p. 16). The authors 
at first state what appears on the surface to be rigor- 
ous criteria for selection, and then mention in 
almost the same breath that "the criteria for meth- 
odological soundness were applied in a way that 
recognized that a design weakness in one area can 
be compensated by a strength in another area" 
(Introduction, p. 4). I fail to see how the authors 
apply scientific canon in their methodology of 
equating one criterion as equal in weight to anoth- 
er.^ Take, for example, the application of their criteri- 
on of grade-equivalent scores. In essence, the grade 
equivalent of a particular score could be considered 
the grade level of the norm group, its median the 
same as the raw score. (For a discussion, see Beggs 
and Lewis. 197&.) Baker and deKanter state that the 
use of grade-equivalent scores was a criterion for 
rejecting studies (ch. 1. p. 15). The issue is not one 
of why this criterion was selected (although argu- 
ments can be made for and against), but how it was 
applied. Rejected studies, conducted by Leyba 
(197B). Trevino (1970) andOlesini (1971). employed 
grade-equivalent scores. Yet Baker and deKanter 
waive the application of this criterion for Kaufman's 
study (196B) and seem to feel that its questionable 
experimental design sufficed to overcome the short- 
coming. This selective application of their criteria 
appears to be a flaw in Baker and deKanter's meth- 
odology and can be cited by other examples. 

The authors supposedly also rejected studies such 
as Cohen (1974) and South San Francisco, which 
examined gains without any control groups. The 
selective application of criteria becomes evident 
with Baker and deKanter's acceptance of Stern's 
study (1 975) which also failed to include any control 
group. Here the authors seem to feel that Stern's 
employment of "some longitudinal analysis and ad- 
justment for the effects of nonrandom selection by 
analysis of covariance" is sufficient compensation 
(Baker and deKanter. ch. 2. p. 5). Their arguments 
are unconvincing, at least to this researcher. The 



trend continues with the criterion of rejecting stud- 
ies which utilized norm-referenced designs. Baker 
and deKanter argue that the nature of the learning 
curve for limited-English proficient students is un- 
known, and that a potential phenomenon with 
achievement tests yields higher gains than are 
actual true measures of performance (ch. 1 p. 13). 
We find that a number of rejected studies, partic- 
ularly St. John Valley (19B0). Corpus Christi (19B0a. 
19B0b.) and Arce (1979) employed a norm- 
referenced design. Nonetheless, the authors waive 
this criterion for Stern (1975). Ames and Bicks 
(197B). This kind of selective application of griteria 
can only serve to destroy the credibility of Baker 
and deKanter's methodology in their professed 
scientific review of the literature. 

5. Another of my strong concerns in observing the au- 
thors' methodology is the way Baker and deKanter 
apply a selective secondary analysis of data to sub- 
stantiate their arguments. Along these lines of 
thought, the authors: a) present incomplete and 
selective data (errors of omission) and b) present a 
biased interpretation of studies which do appear. In 
their review of Covey's study (1973). for example, 
the authors fail to acknowledge his conclusion that 
Mexican-American students who were enrolled in a 
bilingual education program achieved significantly 
higher in the academic disciplines of English and in 
reading than those enrolled in regular school pro- 
grams. (Covey also attributed a favorable attitude 
toward self and others to those Mexican-American 
students enrolled in the bilingual program.) Baker 
and deKanter seem to go out of their way to accen- 
tuate problems in interpreting their selective results 
of the study (see their discussion, ch. 2. p. 5). If the 
study passed their criteria for selection, why would 
they not want to present the data as is? They offer 
"three competing explanations as to why the pro- 
gram worked." which are theoretical and do not de- 
tract from the actual results of the study. They also 
seemed to take the time to have contacted Covey 
and conclude from the communication that a less 
than sufficient , number in their estimation 
"achieved a sufficiently higher level of English profi- 
ciency to be mainstreamed (p 5.)" While this com- 
munication is utilized to clarify their point, why 
hadn't the authors exercised foresight to clarify 
their own expressed puzzlement that Covey's study 
"is very uninformative as to the nature of the pro- 
gram" (see ch. 2. p.4)? Furthermore, how could 
Baker and deKanter presume to classify the study in 
support of one of their theses without knowing the 
nature of the program? A similar pattern emerges 
with the authors' analysis of Skoczylas' study 
(1972). Again. Baker and deKanter declare that 
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bilingual instruction "did not lead to lesser English 
performance (p. 31). They overtook Skoczylas' con- 
clusion that students who were instructed bilingual- 
ly were learning Spanish and English simultaneous- 
ly without apparent difficulty and gained a benefi- 
cial "transfer effect" from language to language 
(1972). Instead of continuing with their own analy- 
sis. Baker and deKanter offer a lengthy extract from 
an evaluation of Skoczylas' study conducted by the 
National Institute of Education. If the authors 
accept conclusions regarding insufficient data ^to 
determine continued significant differences be- 
tween control and experimental groups, they are 
not justified in utilizing this data to justify their 
arguments. 

The subjective interpretation of the authors seems 
to be emphasized by their treatments of Danoff, 
Coles, McLaughlin, and Reynolds' study (1977-7B: 
referred to as the AIR study). Baker and deKanter 
devote almost the entire part of their discussion of 
the study in defending it (ch. 2, pp. 57-60) against 
critical reviews by Cardenas (1977) Gray (1977), 
and O'Malley (197B) instead of presenting an objec- 
tive overview and analysis. The authors fail in one 
representative instance to substantially dispel the, 
argument made by Gray (1977) that the AIR study 
failed to recognize differences among programs 
and approached bilingual education as undifferen- 
tiated and uniform. Baker and deKanter drop the 
role of researchers and answer that the criticism 
does not "acknowledge the needs of policy makers 
to make informed decisions based on representative 
data" (ch. 2, p. 5B). Another implication of the au- 
thors is that the data is representative. Still, they fail 
to pay heed to their own acknowledgement that the 
subsample of Danoff's "reanalysis" (1978) was not 
totally representative of the original sample. Also, 
the lack of comparative classrooms for the grade III 
cohort of Title VII children limits the generalizability 
of results (as well as posing a dilemma for Baker 
and deKanter in their application of criteria: Danoff, 
197B). Moreover, by rejecting studies critical of AIR 
(such as those cited above) and employing authors 
in support of the study (Rossi, 1979 and National In- 
stitute of Education, 1979), the authors seem to 
have a political agenda. The reader may find it more 
than coincidental that Baker and deKanter em- 
ployed a similar review by the National Institute of 
Education (cited above) in criticism of Skoczylas' 
study. Baker and deKanter neglect to mention that a 
number of individuals involved in the AIR study 
withdrew from the project and later wrote critical 
reviews (Anas. Delgado, DePorcel, and Irizarry 
1977, also Gray. 1978). 

In still another instance, the authors categorically 
conclude in Cottrell's study (1971) that "the pro- 
gram effect was probably underestimated by the 
analysis of covariance," since "students from the 
comparison schools were historically known to out- 
perform students from the project sphools..." (Baker 
and deKanter/ 19B1, ch. 2, p. 44). First, there is no 
statistical evidence offered to support their conten- 
^ tion. Second, the authors appear to selectively di- 
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verge from their absolute interpretation of covari- 
ance as a redeeming factor and which they apply in 
other instances (see subsection on statistical appli- 
cations below). Another illustration of Baker and 
deKanter's biased methodology bccurs with their 
examination of Cohen's study (1975). The authors 
conclude that they found "the programs' effect on 
English development to have been neutr$l to a little 
negative with mixed results in arithmetic" (Baker 
and deKanter, 1981, ch. 2 p. 52). This is hi^htynues- 
tionable, considering that bilingual students, for 
example, showed greater gains than d^ the com- 
parison groups regarding communicatii/e ability in 
English. The bilingual group began, "Substantially 
below the comparison group, so tha< their gains 
brought them even with the comparison students" 
(Cohen 1975, p. 161). The authors also neglect to 
mention that the follow-up Level I group seemed 
overall to be roughly equivalent to the comparison 
group in their development of language skills in En- 
glish. Here and in other instances. Baker and deKan-' . . 
ter failed to present a balanced presentation and to 
reconcile the data in support or rejection of Cohen's 
original hypotheses. In terms of academic aptitude, 
bilingual students scored better than comparison 
students on one level, and equally on two other 
levels. In mathematics, bilingual students performed 
as well as students in the comparison group and 
better on at least one level. It is interesting that the 
authors failed to review these findings among 
others. These examples are but a few which illus- 
trate Baker and deKanter's selective analyses. 

6. Another concern is the authors' sense and selective 
application of statistics. Baker and deKanter, for in- 
stance, acknowledge the validity of arguments 
which criticize analysis of covariance in failing to 
overcome difficulties posed by nonrandom selec- 
tion (ch. 1, p. 10: also see for a discussion in the 
use of the process, Elashoff, 1969, Elashoff and 
Snow, 1970: Winer, 1971; and Williams, 1979). In 
studies potentially supportive of the authors' argu- 
ments. Baker and deKanter stress analysis of covari- 
ance as a strength without further discussion. Note s 
their treatment of the Moore and Paar study (1978). 
Baker and deKanter cite the use of arialysis of 
covariance as a strength and conclude that the 
study, therefore, had "better statistical control than 
many studies having nonrandom assignment" (ch. 
2, p. 38). However, Moore and Paar (1978) noted the 
weakness in their use of covariance to adjust for 
pretest differences (whereby, the process syste- 
matically underadjusted for initial differences be- 
tween groups). The warning then, was that the re-^ 
suiting scores of nonbilingual classes should be 
terpreted with caution. In contrast. Baker ap^i 
deKanter approached one study which tended..0ri^ 
face value to discredit their arguments, with a less'" 
than balanced treatment of the researchers' applica- 
tion of the statistical process (see their treatment of 
Cottrell, 1971). 

Another curious example is the authors' treatment 
of the Skutnabb-Kangas and Toukomaa study 
(1976), Baker and deKanter classify this research 
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within their chapter of rejected studies. Neverthe- 
less, they take pains to apply a statistical analysis. 
supposedly to examine the extent to which the data 
support the use of Level 1 instruction with language 
minority children (Baker and deKanter 1981. ch. 3. 
p. 9). At first glance.' the application may appear to 
an experienced statistician to be a "cook-book" anal- 
/ ysis. orignating out of some program such as SPSS 

(Statistical Package for the Social Sciences). The 
very application of statistical tests by the authors is 
questionable, given the potential incompleteness 
and uncertainty of available raw data. Therefore, 
secondary presentation of data in the form of a re- 
: search report portends possible bias in the statisti- 
cal approaches of the reviewer. In reference to 
Baker and deKanter's statistical applications. I am 
at^a toss to see some of the significance and inter- 
pretMion of their approaches. It is unclear as to how 
(anil why), for example, the authors applied both" 
Spearman's rho (r ) and Pearson's (r). Spearman's 
rank-order correlation coefficient (non-parametric) 
is applicable to ordinal-level variables, while Pear- 
son's product-moment correlation coefficient is 
oriented toward pairs of interval-level variables (for 
a discussion, see Siegel. 1956. Guilford and Fruch- 
ter. 1973). Baker and deKanter provide little or no 
indication as to what was correlated and why. let 
alone the appropriateness of their application and 
iriterpretation. 

In interpreting their data, the authors appear to 
selectively diminish the values of their Somer's 
d. and gamma formulations. Instead of presenting a 
balanced view of the data, they diminish the impor- 
tance of these tests, which produce results possibly 
contesting their arguments. Why. then, would 
Baker and deKanter want to employ potentially infe- 
rior statistical tests in their analysis of the 
Skutnabb-Kangas and Toukomaa study (1976)? 

7. My final concern is with Baker and deKanter's ap- 
parent political agenda in the guise of rigorous 
educational research. Perhaps the authors are una- 
ware of their own dilemma in playing the simultane- 
ous roles of policymakers and rigorous researchers. 
The dangers of such a synthesis have been voiced 
by Patton (197B) in his statement: 

The traditional academic values of many social 
scientists lead them to want to be nonpolitical 
in their research. Yet. they always want to 
affect government decisions. The evidence is 
that they cannot have it both ways (p. 46; also 
Atkins and House. 1981). 

Very powerful evidence suggests that Baker and 
deKanter indeed provided a political agenda, appar- 
ently aimed at discrediting the bilingual education 
movement in the United States. The premise is 
simple and logical. If you can discredit the research- 
base of a particular discipline, you then destroy the 
substatiatiort for its raison d'etre. What is not 
known is the level of independent activity or 
pressured compliance under which Baker and 
deKanter labored. Let us start with the title page, 
where Baker and deKanter claim their review "does 
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nqt represent the official position of the U.S. Depart- 
nient of Education." Yet. on page 1 of their introduc- 
tion, they state that "the investigation was begun at . 
the request of the White House Regulatory Analysis 
and Review group for an assessment of the effec- 
tiveness of bilingual education." Both Baker and- 
deKanter work in the Office of Planning. Budget anrd 
Evaluation. The Deputy Undersecretary of Education 
for Planning and the Budget. Gary Jones, appeared 
to injudiciously "leak" information from the report 
(Baker and deKanter's) to the public media. A 
former member of the Fairfax County School Board. 
Jones, was quoted by a leading newspaper as de- 
scribing ESL as preferable to bilingual education in 
the schools (Fairfax County). "We were able to 
prove." he reportedly stated, "our approach was 
every bit as effective as theirs" [The Washington 
Post Sept 29. 1981. p. 16). It should be remem- 
bered that, during the past year. President Reagan 
cited the Fairfax County program as a model, while 
criticizing bilingual education (y\Mfl£ 1981). These 
developments challenge the credibility of Baker and 
deKanter's attempt to portray their assessment as 
unbiased or independent of political influence. 
Additional evidence from an analysis of their 
review lends greater credence to this assessment. 
Baker and deKanter seem unable to manipulate 
their own methodology and must place a flawed 
Fairfax County Public Schools study conducted in 
1980 in the category of rejected studies (ch. 3. pp. 
28-30). Yet. Baker and deKanter make sure that the 
reader notices that the Fairfax County program is 
"very successful when looked at in terms of gain 
over the school year (p. 30). Further analysis brings 
us to the logical second tier of a political agenda. 

If one destroys the substantiation for a movement, 
one has to provide another research base in support 
of arguments. My analysis suggests that Baker and 
deKanter have "dusted off" the aforementioned AIR 
study in an attempted resurrection. At least one 
self-proclaimed "neo-conservative" researcher 
(Miller. 1981) used the AIR study to substantiate 
the evils of maintenance bilingual education (p. 88: 
which conflicts with Baker and deKanter's attempts 
to classify it under TBE). Why. for example, would 
they take the time to place Troike's revievy (1978) 
within their category of rejected studies? Troike's 
work is not an empirical study, such as the ones 
Baker and deKanter claim to analyze, but a critical 
review of research. However. Baker and deKanter 
overlooked this consideration, devotipg time to 
question Troike's analysis while laudlrfg the meth- 
odology employed by the Amehcaj/ Institutes for 
Research in their other reports Mere Baker and 
deKanter conclude, "we have more confidence in 
the validity of the AIR study prepared for the JDRP" 
(ch. 3. p. 24). As mentioned abofve. the authors ap- 
proached their analysis of ther 1977-78 AIR study 
with an examination and refutation of criticisms. 
This attempted rehabilitation was reinforced by 
rejecting such studies as/Leyba's (1978). which 
were critical qf AIR (Alsc/ see Nickel. 1979). The 
political agenda is reflected as well in additional 
comhnents which are icited above in preceding 
subsections. / 
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Some Observations on Conclusions 

Baker arid deKanter's faulty methodology, beginning 
with ill-cOnceived definitions and criteria, inevitably re- 
sults in thb lack of reconciliation between their conclu- 
sions and stated research questions. The best that the 
authors c^n offer are qualified general observations 
which fail to address their originally postulated research 
questions, iln actuality, the two research questions 
raised by jBaker and deKanter have remained unan- 
swered. Agjaia this is not a surprising outcome, in light 
of the auth(l)rs' methodology. They list four categories of 
conclusion^ which are excerpted as follows, in the form 
of the authdrs' subtitles: 

1. Special programs can improve achievement in 
language minority students. 

2. The Federal Government should not place ex- 
clusive reliance on transitional bilingual 
education. 

3. Federal policy should be flexible. 

4. Implroved bilingual research and program evalu- 
ations are needed (Baker deKanter. 1981. ch. 4. 
pp. i 1-6). 

Under category 1. studies are cited to indicate "that spe- 
cial prografns designed to overcome language difficul- 
ties in schools can improve the achievement of lan- 
guage miriority students." The authors add that "this 
conclusior^' says nothing about the effects of any partic- 
ular mstru(j:tional approach" (p. 4). The statement is con- 
sistent wi|h results of individual studies reviewed by 
the authors in their selective processes (McConnel. 
1980a. I^Ob; Zirkel. 1972; Covey. 1973; Plante. 1976; 
Lambert and Tucker. 1972; Malherbe. 1946; and Cohen. 
1975. anrpng others). It would then follow that there is 
insufficiefit evidence to warrant the allegation of inef- 
fectiveness of transitional bilingual education, which 
poses a /dilemma for the authors in their second catego- 
ry (see k. above). A logical outcome would be that the 
federal/government should not place exclusive reliance 
on any program. The original so-called Lau guidelines 
(whicn still appear to be in effect) delineate programs as 
options, according/to particular conditions. Title* VII 
/legislation addresses another set of conditions for fund- 
ing. Baker and del^anter are unconvincing in their argu- 
ments and analysis that other options are suitable sub- 
stitutes for the /framework of TBE. along the lines of the 
experimental ^design they themselves purport to have 
followed. They are less so in stating that their analysis 
has "implicaftions beyond the Federal level." because a 
number of/states have followed the lead in developing 
TBE programs for language minority students. The au- 
thors al^o conclude within this category that their 
selectecTfmdings "do not add up to a very impressive 
case for the effectiveness of transitional bilingual educa- 



tion" (Baker and deKanter. 1981. ch. 4. p. 5). However. I 
have found Baker and deKanter's analysis to be one of 
the poorest and most biased research approaches to a 
review of literature for the stated ' observations in 
preceding subsections. 

Category 3 of the authors' conclusions assumes inflexi- 
bility on the part of the federal government regarding an 
emphasis upon "transitional bilingual education to the 
virtual exclusio'i of alternative methods of instruction." I 
fail to see how this premise is substantiated by the au- 
thors' approach, or. for that matter, reconciled to their 
research questions. No criteria have been presented 
and analyzed relevant to a definition of the methods of 
measuring flexibility or inflexibility. Since their study 
does not contend to be an analysis of federal policy, but 
supposedly an empirical approach. Baker and deKanter's 
conclusions are highly unsubstantiated and irrelevant. 
A similar statement by the authors that TBE has been 
found "ineffective and harmful in other places" also fails 
to find justification. (Their analysis brings to mind the 
reasoning of studies over a half century ago which at- 
tempted to link bilingualism with retardation.) 

The authors' "key to successful teaching in the second 
language." namely that subject content should not out- 
distance language, would appear at first logical. Howev- 
er, since the rate and quality of individual language ac- 
quisition is variable, the learning-rate of subject content 
area would be restricted to that particular medium of 
communication. The authors fail to contend with the 
reality that second language acquisition will not ocbur 
through osmosis and that a necessary time element, 
which varies from individual to individual, will impede 
the rate of subject content acquisition in quantity and 
quality. To coin Baker and deKanter's term, "common- 
sense observation" tells us that it is pedagogically 
sound to teach students subject matter in the known 
language where the symbols convey information in an 
understandable fashion. It therefore would appear that 
Baker and deKanter have promoted their political expe-. 
dient. with little concern as to what is educationally 
proper I have no argument against the prq/nise of expe- 
rimenting in as many promising areas as possible. How- 
ever. Baker and deKanter's category 4 (see above) goes 
against their inference that TBE has been tried and 
proven to be one way or another Their call for "im- 
proved bilingual research and program evaluations" in- 
dicates the need for further investments of Federal dol- 
lars in this area before any kind of empirical-based con- 
clusions can be drawn I will agree with Baker and 
deKanter's observation that bilingual education "in- 
volves many complex, difficult issues that have been 
little or insufficiently studied." It would then follow that 
the authors' suggested federal policy, recommending 
the elimination of TBE. is premature and unwarranted. 
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BEEHIVES AND ICEBERGS 



by 

Joseph W. Beard and Marjorie Powell 



If teachers teach as they were taught, what do we do 
r)ow? 

These comments are reflections on presentations made 
during the fourth day of the 1 981 Language Assessment 
Institute held at National College of Education on June 
17-20. 1981. Although our references are specific to the 
four presentations made on that day. responses of par- 
ticipants indicate that our comments reflected points 
made in the three prior days of presentations. 



types of tests which might be relevant, (c) recommenda- 
tions on types of instruments or procedures which 
might be used to resolve issues raised and (d) the funda- 
mental problem of a lack of a defined language variety 
or register resulting in a lack of a clear or well-defined 
language corpus of each of the specific content areas. 
The presentation raised a number of questions which, 
though specific to this one presentation, have applica- 
bility to the general set of presentations and discussions 
developed during the course of the whole institute. 



In a real sense, the four presentations made on the final 
day brought into focus many of the concepts and ideas 
that had been presented during earlier discussions at 
the Institute. Gary Keller's presentation on "Integrating 
Language Assessment with Teaching Performance in 
Subject Areas" gave perspective to the relationship be- 
tween language and the various content areas covered 
in our schools. The presentation delivered by Inez Bo- 
sworth on "Foreign Language and Bilingual Assess- 
ment: Issues and Approaches" demonstrated bridges 
between the bilingual education movement as it has 
been experienced over the past decade and the current 
affirmation of needs in the foreign language area. The 
presentation on "Language Proficiency Assessment for 
Vocational Occupational Training" delivered by Ron 
Perlman gave breadth to the understanding of work and 
career aspirations of students who come from language 
backgrounds other than English.* Stan Seidner's presen- 
tation on "Establishing Entry/Exit Criteria for Language 
Assessment" gave significant evidence of the state and 
status of bilingual education in colleges and universities 
which prepare professional staff for bilingual education 
programs. 

The Keller presentation was particularly gratifying; a 
background in science education has influenced the au- 
thor's constant search for people who are dealing with 
the issue of language usage in the various subject 
matter areas of the curriculum, particularly mathematics 
and science. It has seemed at times as though we 
expect our work using general communication language 
in bilingual education to automatically result in an un- 
derstanding of content-specific areas, in the same way 
that alchemists worked on changing lead into gold. The 
focus over the past years on general language proficien- 
cy, coupled with a concern for competence in language 
arts, reading and culture, has resulted in our not ad- 
dressing the use of language in the various subject 
areas of the curnculum. 

Dean Keller's exploration of the language needed for 
teaching in the vahous subject matter areas in a bilingu- 
al format was both well-developed and consistent in its 
description of; (a) language variety or register as ap- 
propriate for each content area, (b) a selected review of 



1 . What IS the primary goal of bilingual education ? 

If the mandates of federal and state funding define 
the primary purpose of all bilingual education as ac- 
quisition of a single language. English, then the 
issue of the range of language needed to learn in 
many subject areas becomes a moot argument. In 
this instance, the development of communication 
skills in English and the learning of basic tranfer 
skills in a home language may be all that is neces- 
sary to achieve transition to English. However, if a 
program, school district or learning environment 
has as its primary purpose or function the develop- 
ment of individuals who have full skills in speaking, 
reading and writing in at least two different lan- 
guages, then the issue of how these sIciHs-^re devel- 
oped in each of the subject matter fields, as well as 
in the breadth of human discourse, becomes a sig- 
nificant factor In the former case, the issues raised 
by Dean Keller are academic in nature and have no 
real significance in the development of program 
focus for students achieving sufficient skilf in the 
English language to continue their learning experi- 
ences in only that language. In the latter case, that 
of expanding one's language experience, exposure 
and learning opportunity in more than one language 
(English, the home language, a third or fourth 
language), the issue of content-specific language 
as described and developed by Dean Keller will 
have to be dealt with. 

2. What is the applicability of issues which Dean 
Keller discussed in terms of Spanish-English 
bilingual programs to other languages included 
among our bilingual education programs as well 
as to the political, economic and social interactions 
of the late 20th century? 

Those language groups most similar to English with 
almost the same alphabet, such as the Romance 
and the German languages, pose a much smaller 
problem in the content-specific language areas (es- 
pecially with the penchant for borrowing words be- 
tween languages such as French. Spanish. English 
and German) than do those languages with different 
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sound systems and/or different alphabets. Even 
though the words which describe the engineering 
feats of the 20th century transcend language 
groups, the basic concepts of science and math- 
ematics are expressed significantly differently 
within each language community. This difference 
necessitates a concern for how those concepts can 
be transmitted to learners who come from those dif- 
ferent language groups. 

3. \Atou/d the level of sophistication of the learner in a 
given subject matter field influence his or her need 
for the development of content-specific language 
in the hom e language ? 

It may be that the basic skills in mathematics such 
as adding, subtracting, dividing and multiplying or 
the basic processes of science, such as inquiry and 
decision making, contain a language corpus in each 
language which is fairly consistent and applicaljfele. 
This basic language may be available within the 
general language knowledge of both the teacher 
and the learner. It may be that the development of 
advanced skills in areas such as calculus, physics, 
chemistry, topography, etc., results in a more specif- 
ic language base that would create the kind of prob- 
lems and difficulties which Dean Keller addressed. 
Thus, there may only be a need to define the lan- 
guage corpus for advanced studies within a subject- 
matter art 'a. 

4. \Atould consideration of how other countries re- 
spond to language-different people have value for 
us in the United States of America ? 

For example, if we consider Japan and the develop- 
ment of personnel who speak two languages to 
serve that country's commercial efforts or the re- 
sponse to linguistically different learners coming to 
- school, would we find Japan facing a similar issue 
or have they found a way to insure greater success 
in the development, implementation and mea$ure- 
ment of language curriculum? * 

5. How much can we separate the issue of the devel- 
opment of content-specific language in a home 
language from the issue of the development of 
concepts in a home language which might then be 
applied in another language ? 

The question is whether the development qf con- 
cepts requires the use of the technical language. It 
may depend upon the teaching approach used. If 
the teacher is presenting the concepts, then techni- 
cal language may be necessary to discuss the con- 
cepts. If the teaching approach is process-oriented, 
designed to assist learners to develop an under- 
standing of how mathematical concepts or scientif- 
ic knowledge is derived, the same issue of the body 
of technicat language may not be as significant. 

6 Is there a gap between the reality of human inter- 
action using language and the ideal perspective of 
Q how language might or could be used? 
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The realities of the classroom and of the contexts 
surrounding the classroom, such as district and^ 
union hiring policies, the supply of qualified teach- 
ers, and the . procedures lur teacher certHication, 
may a^l impose constraints on the achievement of 
ideal goals in terms of teacher knowledge of and 
skill in using two languages for instruction. As cer- 
tification processes have developed over the past 
twenty years and faculties at all school levels have 
reaffirmed the principle of academic freedom, 
where are the forces for change which can result in 
effective classroom behavior and linguistic skills of • 
teachers? Will the current political perspective 
which results in an English language chauvinism 
significantly alter the concern for development of 
language in subject-matter areas? 

7. How might the development of bilingual education 
approaches and concepts be truly institutionalized 
in schools? 

Implicit in the notion of developing content area 
language in multiple subject areas and in multiple 
languages is the anticipation thi learners will 
indeed become bilingual or biliterat The develop- 
' ment of bilingual instruction in multiple content 
areas could indeed make bilingual education an. en- 
richment process fully institutionalized in the 
schools. 

8 How does bilingual education differ in its issues 
and problems from learning and teaching experi- 
ences in monolingual English programs? 

We might suggest that the issues are the same, 
that they only differ in the ^gree of difficulty and 
of significance. The issue of appropriate language 
used with consistency and reliability in monolingual 
English science and mathematics classrooms is 
similar to the issue of utilizing the appropriate lan- 
guage for instruction in Spanish, Chinese^ or any 
other language. Bilingual education highlights Such 
problems and issues, but they then need to ba ad- 
dressed outside of the political and economic fac- 
tors of the existence of bilingual education. 

9. Do differences exist in the potential of developing 
language understanding among experienced 
teachers and people involved in teacher education 
programs? 

The development of a teacher poo! during the 
teacher shortage of the I950's and the early 1960's 
resulted from and led to conditions very different 
from those which exist in 1981. The current condi- 
tions, a shortage of positions for teachers, coupled 
with growing shortages of teachers trained in 
specific subject-matter areas and willing to teach in 
specific geographic areas, such as rural areas, will 
impact the development of our schools through the 
1980's. The opportunity now exists to utilize our 
resources much more effectively in developing 
teachers whose language (and other) skills can, in 
fact, be responsive to. the needs of the students 
they serve. 
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10. Are basic language "skills necessary or appropriate 
in order for students to develop advanced learning? 

Can the language' necessary to master advanced^ 
concepts in a language be separated from basic 
communication skills in a language? The existence 
of scientists who can comprehend scientific texts 
ih a second language but who cannot carry on 
simple conversations in that language may point to 
the separability of basic and technical language 
learning. As students move from basic communica- 
tion skills to the development of language which 
can be used to describe content>based experiences 
and concepts, students may need to learn the dif- 
ference between (1) their own inherent abilities to 
communicate in whatever language is appropriate 
for the environment in which they find themselves 
and (2) their need for a more scientific discourse as- 
sociated with the content area. 

•i 

1 1 . Should bilingual education and the issues associat- 
ed with language assessment respond only to the 
existence of federal or state or extra local funding 
specifically for bilingual programs or should it 
become an inherent part of a school program? 

.What approaches may be taken to institutionalize 
further the basic concept of responding to linguisti- 
cally different learners and the development of 
multiple language skills? 

Additional questions and issues were raised by the 
other three presentations of the final day of the 
institute. In a delivery of a paper by Woody Woodford. 
Inez Bosworth presented the history of language assess- 
ment testing over the past twenty yers and gave an indi- 
cation of current and future directions and issues in the 
Jield of language proficiency testing. A numtoer of the 
; '^ problems encountered in assessing the extent to which 
English-speaking students have acquired skills in a 
second language in secondary or higher education pro- 
grams remain unresolved as we struggle to assess the 
language skills of students and teachers in bilingual 
education programs. This historical review provides a 
valuable perspective on the current problems and pro- 
posed solutions. 

In a paper co-authored with Else Hamayaa Ron Perlman 
discussed test construction and development in re- 
sponse to specific nejeds associated with vocatlonaT 
education. In some wiys, these problems echo those 
identified by Gary Keller in that they are rela ted to t he 
use of technical, rather than basic, language. However 
basic communication skills are also essential to success 
in vocational settings, as these authors point out l\\jk 
presentation also addressed the language and vocation- 
al interests of multiple ethnic groups, reminding u^ of 
the wide range of language and cultural groups for 
whom bilingual education is an important issue. 



Sfan Seidner brought closure to the series of izJoncepts 
associated with the issue of language assessment by 
describing the approaches which Colleges ar/d universi- 
ties use to determine the language proficiency of stu- 
,"denr. in bilingual education programs ana the types of 
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people involved in making decisions about levels of lan- 
guage proficiency. His survey revealed the lack of con- 
sistency in assessment methods, the multiple people in- 
volved in assessment, and the focus on general com- 
munication skills. Thte survey also indicated the limited 
extent to which bilingual teacher training programs 
have been institutionalized in colleges and universities. 
There remains a strong reliance on federal dollars and 
regulations. 

These presentations sparked several questions which 
also are applicable to previous presentations at the 
Institute. 

1 . What is the nature of the testing situation ? 

Although it may be a truism Which we tend to 
forget;- the circumstances in which the client is 
being tested (environment, number of people, 
noise, light, etc.) can have a significant effect on the 
likelihood that the person being tested will demon- 
strate the particular skill Or knowledge which the 
test is intended to measure, if the person, ih fact 
possesses the skill at a Jl^evel which the test is de- 
signed to measure. y 

2. What effect does t^e age of the people being 
tested have on their ability to demonstrate the skill 
which the test is designed to measure? 

/ ' ' X 

Tests developed/and used with young children are 
not appropriatiB in testing/language proficiency 
skills among high school students or adults and 
vice versa. One needs to designate the age and ex- 
perience ov individuals being tested in order to 
begin to deftermine the appropriateness jof tests and 
the interpretation of results. 

3. How much influence do the content and the 
formajt of a test have on results of that test? 

Agafin, the need to specify the impact of the content 
arid format of the test on expected and identified re- 
sults is a need that we recognize but one that we 
sometimes overlook in the context of discussions 
^such as those presented at the Institute. 

What are the technical issues surrounding the de- 
velopment use and interpretation of tests? 

Which of these technical issues can be anticipated 
and controlled for; which ones are unique to a par- 
ticular testing situation? 

5. What is the purpose forgiving a particular test? 

What decisions might be made from an interpreta- 
tion of the results of a given test? In language as- 
sessment in particular, the purpose for testing 
could well influence the kind of test used and steps 
necessary to develop that test 

6. What are the local, state and national level political 
issues which influence the decision-making pro- 
cess prior to. during and after the use of a test for a 
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particular purpose ? 



The development of language assessment pro- 
cesses for bilingual or foreign language education 
will not take place in a vacuum, but rather, in an en- 
vironment in which political and economftc issues 
become contributing factors td'the decision-making 
prdtess. 

Each of the papers illustrated, in varied ways, the effects 



of political issues. These issues will continue to interact 
with the technical issues to influence and shape future 
directions in language assessment. While we worry 
about the technical and political issues, we must also 
grapple with the real effects on the individuals whose 
language proficiencies are being assessed, the students 
for whom all this effort is be»ng undertaken. 



